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Description 

Field of the invention 

5 The invention is in the field of recombinant DNA. More in particular, it is related to modified plants, processes for 

the site-directed modification of the genome of plants, and DNA constructs used therein. 

Background of the invention 

10 During the recent years, techniques have been developed for the genetic manipulation of plant cells and the re- 

generation of these plant cells into transgenic plants. On the one hand, direct DNA transformation of plant protoplasts 
may be used for the introduction of the desired DNA into plant cells. For this purpose, several methods are available, 
e.g. Ca/PEG (Krens et aj., 1982; Negrutiu et a], 1987), electroporation and microinjection (Crossway et al., 1986). 
Using the recently developed microprojectile method (Klein et a], 1987) also intact plant tissues may be transformed 

15 with 'naked' DNA. On the other hand, the desired DNA may be introduced into the plant cell using the natural DNA 
transfer system of Aqrobacterium tumefaciens and Agrobacterium rhizogenes bacteria (for review, see Klee et aj., 
1987). 

Agrobacterium tumefaciens and Agrobacterium rhizogenes , after attachment to the plant cell wall, are capable of 
transferring a piece of DNA to the plant cell. Such a piece, the transfer-DNA (T-DNA), is as T-region part of a large 
20 plasmid (1 90-240 kbp) in the bacterium, which is called the Ti-plasmid in the case of A.tumefaciens and Ri-plasmid in 
the case of A. rhizogenes . The T-DNA becomes integrated into the nuclear genome of the plant cell (Tomashow et aj., 
1 980; Chilton et aj., 1 982). Genes residing in the T-DNA are expressed in the plant cell and cause the latter to behave 
as a tumor cell (Ooms eta]., 1981; Willmitzer et aj., 1982a+b). 

In addition to the genes that are responsible for tumor induction also genes are present on the T-DNA which take care 

25 of the production of so-called opines. Opines, like octopine and nopaline, may serve as energy, nitrogen and/or carbon 
source to Agrobacterium . The enzymes that are needed for the catabolism of these opines are encoded by genes that 
reside on the Ti- (Ri-) plasmids (e.g. Bomhoff et aj., 1976; Kerr and Roberts, 1976; Hooykaas et al., 1977). Depending 
on the opine production, the Ti- and Ri-plasmids are classified into groups (for example octopine or nopaline plasmids). 
The T-region is confined by two imperfect direct repeats of 25 base pairs, also called 'borders' (Yadav et al., 1982; 

30 Zambryski et aj. , 1982; Gielenetal., 1984; Slightom et aj., 1985). The presence of these borders in cis is a prerequisite 
for correct transfer of T-DNA (Wang et al., 1 984; Peralta and Ream, 1985). 

The presence of the right border is necessary for the efficient T-DNA transfer (Ooms et aj., 1982; Shaw et al ., 
1 984b; Wang et aj. , 1 984). Depending on the test system it was found that deletion of the left border in some experiments 
does (Bakkeren et al., 1989) whereas in other experiments does not lead to a lower frequency of T-DNA transfer to 

35 the plant cell (Hille et aj., 1983a; Joos et aj., 1983). Next to the right border a sequence is present that significantly 
increases the efficiency of T-DNA transfer (Peralta et al., 1986; Van Haaren et al., 1986, 1987; Wang et aj., 1987). The 
action of this 'enhancer' element is independent on position or orientation with respect to the right border (Van Haaren 
et aj, 1986). From experiments with synthetic borders it appeared that the right and left border sequences are inter- 
changeable and, consequently, the 'enhancer' determines which border sequence becomes the dominant right border 

40 (Peralta et aj. , 1 986; Van Haaren et aj. , 1 987). 

In addition to the T-DNA, there are virulence genes that on the one hand reside on the chromosome, on the other hand 
on the Ti-plasmid (Vir-region). These genes are involved in attachment of the bacterium to the plant cell and in the 
transfer process of the T-DNA to the plant cell (for review see Melchers and Hooykaas, 1 987). 

All the gene transfer systems mentioned above have in common the disadvantage that the site of integration of 

45 the transforming DNA is unpredictable. Thus, as with the other plant transformation techniques mentioned above, the 
DNA that is introduced into the plant cell via Agrobacterium appears to become integrated at random locations in the 
genome (Chyi et al ., 1986; Wallroth et aj., 1986; Spielman and Simpson, 1986). In certain situations, however, it is 
desirable or even necessary to determine the site of integration beforehand. Thus, the gene to be introduced might be 
targeted to a location where the desired regulation of expression is guaranteed. Also the newly introduced DNA could 

50 be used to mutate or inactivate a specific plant gene. 

Several methods have been described to integrate DNA sequences into the plant genome in a site-specific manner. 
These methods are all based on a mechanism known as homologous recombination. 

Homologous recombination is a process that occurs very efficiently within bacteria and yeasts. In these organisms 
it is used for site-directed integration of newly introduced DNA (Ruvkun and Ausubel, 1981; Orr-Weaver et aj., 1981). 

55 in yeast it was found that DNA molecules, linearized in the area of homology with DNA integrated into the genome, 
recombine with a 10-1000 fold higher frequency. More recently, also in mammalian cells homologous recombination 
between genomic and newly introduced DNA was found to occur (Smithies et al., 1 985; Thomas and Capecchi, 1 987; 
Song et aj., 1987; Baker et aj., 1988; for recent review see Capecchi, 1989). Also in these systems it appeared that 
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upon co-transformation of two defective mutants linearisation of one of the mutants in the region of homology resulted 
in - on an average - a 10-fold higher recombination frequency (Kucherlapati et aL, 1984). 

Recombination between two homologous DNA molecules, after their simultaneous introduction into a plant cell, 
has been reported by Wirtz et aL, 1987. 

European patent application (EP-A-0 317 509) discloses a method for the integration of DNA sequences into the ge- 
nome of plants through homologous recombination. According to the application the introduction of the DNA construct 
into the plant host may occur by known techniques, such as the Agrobacterium transfer system. In the Examples, a 
direct DNA transformation method (with "naked" DNA) was actually used to introduce the incoming DNA into polyeth- 
yleneglycol (PEG) treated tobacco protoplasts. 

It was stated that modifications on exactly defined locations in the plant genome could be obtained. However, the 
results of the experiments, using different defective APH 1 1 genes, conferring kanamycin resistance, were not conclusive 
as to whether restoration of the gene occurred on the desired locus ("in situ "). In a later published article on the same 
experiments by one of the inventors, Paszkowski et al. (1988), it was only assumed that restoration of the defective 
APHII gene, due to homologous recombination with the incoming defective APHII gene, could have occurred on locus, 
"but further evidence to confirm this was required". 

There is still a need for an efficient method for in situ modification of the plant genome and selection of the desired 
mutants. 

Summary of the invention 

The present invention provides a method for introducing a defined mutation in a selected target locus of a nuclear 
plant genome by integrating at least a part of a recombinant DNA into said genome through homologous recombination 
at the target locus, which recombinant DNA has the following general structure, 



in which the boxes 1 and 7 and the connecting lines represent DNA sequences that are capable of functioning as a T- 
DNA border in the DNA transfer process, 

in which box 1 or 7 may be absent but not both, in which box 3 comprises a DNA sequence which is sufficiently 
homologous to a DNA sequence inside the target locus and sufficiently long to promote homologous recombination, 
in which box 4 represents a DNA sequence that is not homologous to sequences occurring in the target locus, and in 
which the lines connecting the boxes may represent any number of basepairs, or no basepairs, 
and identifying transformants having the defined mutation in said target locus using known methods. 

The invention also provides recombinant DNA capable of integrating a part of itself into the genome of a plant host 
via homologous recombination at the target locus, which recombinant DNA has the following general structure with at 
least the following requirements : 



in which the boxes 1 and 7 represent DNA sequences that are capable of functioning as a T-DNA border in the DNA 
transfer process. 

in which box 1 or 7 may be absent but not both, 

in which both box 3 and 4 represent DNA sequences which are sufficiently homologous and sufficiently long to promote 
homologous recombination with the target locus, 

in which the DNA sequences within the boxes have the same 5' to 3' orientation as in the target locus, but the order 
of the boxes themselves have been changed with respect to the situation in the target locus, resulting in insertion of 
box i in the target locus after homologous recombination, in which the lines connecting the boxes may represent any 
number of basepairs, or no basepairs. 

The invention further provides DNA-constructs that additionally enable selection of those plants that contain the 
defined mutations on the desired genomic location. These constructs are especially useful if the integrated mutation 
is phenotypically difficult to detect. The invention also provides methods to integrate DNA sequences containing defined 
mutations on desired locations of the plant genome, by introducing these T-DNA constructs to the plant cell using the 
DNA-transfer system of Agrobacterium tumefaciens , or species related thereto. Also vectors are provided, containing 
said recombinant T-DNA constructs, as well as bacteria transformed therewith. 
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Detailed description 

It was surprisingly found that two T-DNAs, containing regions of homology, which are simultaneously introduced 
into a plant cell, are physically capable of homologous recombination. Moreover, newly introduced T-DNA, hereinafter 

s referred to as the 'targeting construct', containing regions of homology with the plant genome of the acceptor plant cell 
was also found to be capable of recombination with the genomic DNA sequence, within the regions of homology. In 
some cases the restored phenotype, viz. kanamycin resistance, was shown to be the result of restoration of the mutated 
locus, i.e. the plant had been modified jn situ, on the desired locus in the genome. The desired locus will also be 
indicated hereafter as 'target locus'. 

10 The recombined sequences in the plant genome appeared stable and genetically inheritable. 

The frequencies of recombination (compared with the number of transformants) found by us for the Agrobacterium 
system are comparable with the frequencies that were found by Wirtz et a]., (1987) and by Paszkowski et aj., (1988) 
for the direct DNA-transformation method, using naked DNA. 

The possibility to use the Agrobacterium transfer system to enforce homologous recombination in plant cells was quite 
is unexpected, in the light of the structural properties of T-DNA, and the involvement of proteins in the transfer proces. 
This is illustrated by the following findings; 

a) using the Agrobacterium transfer system, the transforming DNA (T-DNA) is excised from the transforming plas- 
mid at the T-DNA borders, and consequently, is not linearized within a region of homology with the target locus, 

20 as is the case with all naked DNA, 

b) unlike with naked DNA, the T-DNA molecule was not expected to be available for homologous recombination 
due to its single-stranded nature and as a consequence of the association with virulence proteins such as Vir E 
(Gietletaj., 1987; Das, 1988; Sen eta]., 1989; Citovsky etaj., 1988, 1989; Christie eta)., 1988) and Vir D2. VirD2 
is even known to be covalently attached to the 5'end of the so-called T-strand (Young and Nester, 1 988; Herrera- 

25 Estrella et aj., 1988; Ward and Barnes, 1988). A model is described in which the T-DNA is transferred to the plant 

cell as a single stranded linear molecule (Stachel et a|., 1987; Albright etaj., 1987) and in which single stranded- 
DNA binding proteins possibly protect the DNA from nuclease activity before integration into the genome (Gietl et 
al., 1987; Citovsky etaj., 1988, 1989; Das etaj., 1988; Sen et al., 1989; Christie et al., 1988; Young and Nester, 
1988; Herrera-Estrella et al., 1988; Ward and Barnes, 1988). 

30 c) the very high background that is to be expected, due to the very efficient random integration of T-DNA; this 

requires a rigorous selection mechanism to find plant hosts with the proper integration (i.e. via homologous re- 
combination) jn situ, while discarding the non-transformed hosts, as well as the overwhelming majority of the plant 
hosts having undesired random integrations of the entire T-DNA in their genome. Especially, when the desired 
mutations to be integrated into the plant genome are difficult to observe immediately (i.e. that have a phenotype 

35 that can not easily be observed or screened for), this poses a serious problem. In fact many jn situ modifications 

are difficult to detect. 

Although it was mentioned that Agrobacterium can be used as DNA-transfer system to direct homologous recom- 
bination, until now, no results had been reported, which is supporting the above mentioned doubts with respect to the 
40 suitability of the Agrobacterium DNA-transfer system for the site directed mutagenesis of the plant genome. 

Some of the advantages of the use of Agrobacterium as a DNA-transfer system to enforce site-directed mutagenesis 
of the plant genome over the use of naked DNA transformation, are, among others: 

1 ) Protoplasts are transformed by Agrobacterium with a considerably higher frequency than is possible when using 
45 naked DNA transformation. From calli that were regenerated from protoplasts of Nicotiana tabacum cv. petit havana 

SR1 , cocultivated with Agrobacterium during 72 hours, 20-50% appeared to be transformed (van den Elzen et aj., 
1985a; Depicker et aj., 1985). Even if the recombination frequency with T-DNA would be lower than the recombi- 
nation frequency with naked DNA, the percentage of cells that is transformed by homologous recombination could 
be higher due to a higher transformation frequency using the Agrobacterium DNA-transfer system. 

50 

2) Regeneration of protoplasts into plants is still a problem for many plant species. This limits the use of 'naked' 
DNA-transformation, with which it is necessary to work with protoplasts (except for the particle bombardment 
technique, which may use regenerable plant parts, however this technique causes a lot of scrambling, and for this 
reason appears not very useful for this purpose). Furthermore, regeneration of protoplasts into plants proceeds 

55 through a callus phase. During such a phase somaclonal variation is often observed. Somaclonal variation includes 

all chromosomal rearrangements in mitotically dividing tissue and results in chimaeric tissue. To obtain transformed 
plants via Agrobacterium, no regeneration of protoplasts is needed. Easily regenerable tissues of the plant like e. 
g. leaf discs (Horsch et al., 1985), potato tuber discs (Sheerman and Bevan, 1988) or meristems (Ulian et aj., 
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1988) can be used to provide transformed plants after cocultivation with Agrobacterium . Shoots can also be re- 
generated with reasonable frequency from leaf discs of tobacco plants that have been inoculated with A.rhizogenes . 
Easily regenerable plant tissue, that can be readily transformed using Agrobacterium . can now be obtained from 
many other plant species as well. 

5 

3) In contrast with integration of naked DNA, the integration of the T-DNA is precise. With this is meant that T-DNA 
copies often integrate intact and that 'scrambling' hardly occurs (Hain et al., 1 985; Czernilofsky et a)., 1 986; Deroles 
and Gardner, 1 988). Wirtz et aj. (1 987) and Paszkowski et aj (1 988) find rather complex integration patterns in the 
plant genome, in their experiments, where they use naked DNA-transformation to examine homologous recombi- 

10 nation. 

4) Some naked DNA-transformation methods require so-called carrier-DNA to increase the transformation fre- 
quency. This carrier DNA might interfere with the homologous recombination event. Besides, this carrier DNA 
becomes more or less randomly integrated into the plant genome (Peerbolte et aj., 1985) and, consequently, 

is causes undesired mutations in the genome of the host. The Agrobacterium system does not require carrier-DNA. 

The attention for the Agrobacterium system further increased after it appeared that in addition to dicotyledonous 
plants also monocots - among which bulbous plants, asparagus and cereals - can be transformed using Agrobacterium 
(Hooykaas-van Slogteren et al., 1984; Hernalsteens et aj., 1984; Graves and Goldman, 1986, 1987; G rims ley etaj., 
20 1 987). 

In general it was found that two T-DNAs are capable of homologous recombination within the plant cell. In particular 
this was shown for a model gene namely the NPTII-gene, conferring kanamycin resistance (Km r ). The NPTII gene is 
derived from the bacterial transposon Tn5 (Becket aj., 1 982) and codes for the enzyme neomycin phosphotransferase. 
When the NPTII gene is placed under regulatory signals derived from plant genes (or Agrobacterium T-DNA genes), 

25 it can be expressed after introduction in the plant cell and provide resistance to the antibiotic kanamycin (Bevan et aj., 
19B3; Herrera-Estrella et al., 1983). Plant cells are sensitive to kanamycin. Therefore, the gene can be used as a 
selectable marker for transformed cells or tissue slices. This Km r gene is chosen as model gene for the detection of 
homologous recombination in plant cells, viz. protoplasts of Nicotiana tabacum cv. petit havana SR1. 
Upon separate transformation of T-DNAs containing a defective NPTII gene, no Km r protoplasts could be obtained. 

30 However, simultaneous introduction of both defective NPTII -genes by cocultivating tobacco protoplasts with two dif- 
ferent Agrobacterium tumefaciens strains, each containing a different defective NPTII-gene in a binary vector, resulted 
in Km 1 " tobacco cells. These cells could be regenerated into Km r calli and subsequently, into Km r plants. Analysis at 
the DNA level confirmed that both defective NPTII-genes complemented each other through homologous recombina- 
tion. Analysis at the protein level also showed the recombination event had occurred at exactly the correct site within 

35 the coding region of the gene. The possibility that the recombination event had taken place in Agrobacterium was 
excluded by control experiments. 

This shows, for the first time, that homologous recombination in a plant cell between two T-DNAs sharing a region 
of homology is possible. 

In another experiment, a transgenic tobacco plant, containing a defective NPTII gene in its genome was transformed 
40 with a repair construct that was introduced into the plant cell using the Agrobacterium DNA-transfer system. The repair 
construct also contained a defective NPTII-gene mutated at a different location within the gene. It was shown that the 
mutation could be restored due to homologous recombination between the newly introduced NPTII mutant in targeting 
construct and the mutant NPTII gene residing in the plant genome. Again, recombination had occurred correctly, re- 
sulting in a restored gene, encoding a fully active NPTII-enzyme, that is identical to the wildtype NPTII product. Most 
45 importantly, analysis at the DNA level revealed that recombination had occurred at the target locus, resulting in resto- 
ration of the gene in situ . 

In other experiments it has been shown that targeting is also possible to sequences naturally occurring in the plant 
genome (endogenous sequences). 

In these experiments an endogenous gene, viz. a member of the rbc SSU multigene family, was selected as target 
50 locus to investigate the possibility to mutate endogenous sequences ]n situ via homologous recombination. Genes of 
this family code for the small subunit of ribulose-1 ,5-biphosphatecarboxylase/ oxygenase (rbcS), a nuclear encoded 
chloroplast localized protein involved in photosynthesis. Expression of the SSU genes is light-dependent. 
Therefore two targeting-constructs were made, each comprising a different translational fusion between the coding 
region of this rbcS gene, consisting of four exons and the coding region of the NPTII gene (see figure 10). Onechimaeric 
55 gene, a translational fusion in the second exon of rbcS, codes for a protein in which the transport peptide (involved in 
targeting the rbc gene product to the chloroplast) and the first 23 amino acids of the mature SSU protein are fused N- 
terminally to NPTII. A similar fusion protein between the transport peptide and the first 23 amino acids of the mature 
protein of the rbcS gene from pea and NPTII has already been described and appeared functional (Schreier et aj., 
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1 985). The other fusion gene codes for a protein in which the transport peptide and 99 amino acids of the mature SSU 
protein of the N-terminus of the NPTII enzyme are fused. Up to 0.01% of the transformed calli appeared km r , which in 
most cases appeared light-regulated, whereby some of the SSU-NPTII fusion protein appeared transported to the 
chloroplast. Analysis of the genomic locus at the DNA level, using Southern blotting and PCR (Polymerase Chain 
5 Reaction) analysis, revealed that the light-regulated Kanamycin resistance of part of the transformed calli, indeed 
resulted from modification of the target locus in situ , i.e. at the selected rbcS locus. 

The results of this experiment prove, that desired mutations can be introduced in desired loci in the plant genome, 
independent of the fact whether the target-locus consists of a DNA sequence that is exogenous or endogenous, via 
homologous recombination, using T-DNA constructs. Furthermore, it nicely illustrates, that any functional part of a 

10 gene or an entire gene, which is functional in combination with a functional part of any other gene residing in a selected 
target locus of the genome of the plant host may be correctly fused to that part in situ , via homologous recombination 
between the targeting construct and the target locus, resulting in a functional fusion of the two functional gene parts. 
In its broadest perspective, the present invention provides a method for manipulating in principle any part of the plant 
genome, without changing the location of that part of the genome. 

'5 in the experiments mentioned above, the homologous recombination events could be readily detected, since the gene 
fragments sharing homology were able to complement each other, thereby conferring kanamycin resistance to the 
host. However, in many cases, the mutation one wants to introduce into the plant genome via homologous recombi- 
nation, can not be selected for directly. For instance mutations such as exchange of amino acids or altering codon 
usage of genes and the like, may not be readily detectable, and thus have to be analysed on the DNA level. Such 

20 analyses may involve, restriction mapping of the genomic DNA, PCR-analysis, and/or DNA-sequence analysis. 

Since, in these cases, it is very awkward, time consuming and expensive to screen all coincubated cells for the desired 
recombination event on the DNA-level, it is desirable to be able to select for transformed hosts, using a positive selection 
marker, such as an antiobiotic gene or a herbicide resistance gene and the like, which have a high probability of being 
transformed through homologous recombination instead of random integration at undesired locations in the genome. 

25 Since the random integration frequency is known to be far greater than the frequency of homologous recombination, 
a strong selection mechanism is required to discard all those cells that have been transformed through random inte- 
gration. Therefore, it is preferred, according to the invention, that a positive selection gene is used in combination with 
a negative selection gene, optionally two negative selection genes, residing in the targeting construct, outside the 
regions involved in the homologous recombination event. The general structure of such a T-DNA construct is outlined 

30 in figure 13A. The boxes 1 and 7 each represent T-DNA borders, boxes 2 and 6 are expression cassettes containing 
a negative selection gene, box 3 and 4 contain sequences used for recombination and mutation of the target locus, 
and box E contains sequences involved in recombination outside the target locus. The negative selection genes serve 
to provide disadvantage (preferably lethality) to the cells that have these genes integrated in their genome in an ex- 
pressionable fashion (e.g. in an expression cassette, containing the structural gene between the regulatory regions 

35 that are required for proper expression in the plant host). Accordingly, all cells having the entire T-DNA randomly 
integrated in the genome will be harmed or killed, due to the presence of the negative selection genes. Only those 
cells will survive that have the positive selection gene integrated in their genome, while the negative selection genes 
have been discarded as a consequence of recombination within the regions of homology. Since scrambling of T-DNA 
is known to occur at a considerably lower rate as compared to naked DNA, the combination of a positive selection 

40 gene inside the regions of homology, and one, optionally two, negative selection genes outside the regions of homology, 
has special advantage if used in combination with the Agrobacterium DNA-transfer system. However, it may very well 
work with naked DNA too. 

Since the property of intact integration (lack of scrambling) will probably be connected with the fact that T-DNA is 
packed with proteins, this system might also work with T-DNA packaged jn vitro , as well as with naked DNA of other 

45 origin than Ti/Ri vectors, that has been packed with DNA-binding proteins. 

In accordance with the invention it is highly preferred if the construct contains two T-DNA borders in their most 
active orientation. However it is known in the art that one T-DNA border is sufficient for DNA transfer, and also borders 
can be used which are synthetic or are integrated in the opposite orientation with repsect to the wildtype situation. 
The choice of the positive selection gene (contained in box 5) is not critical with respect to the invention as long 

so as it is functional in the host, and administered to the plant host in an expressionable fashion. The positive selection 
gene may be chosen from the group including (but not limited to) NPTII-gene (encoding resistance for kanamycin), 
HPT-gene (hygromycin resitance), the ALS gene (chlorsulphuron resistance), DHFR-gene (methothrexate resistance). 
In order to express these genes, strong constitutive promoters may be used, such as the 35S, or the 19S promoter 
derived from the cauliflower mosaic virus (CaMV), T-DNA promoters from Agrobacterium, but also plant promoters, or 

55 any promoter which is functional in the host. In general it is preferred that the DNA-sequences not meant to be involved 
in homologous recombination do not have homology with DNA sequences residing in the genome of the plant. However 
in case this is required, the regions of homology not meant to participate in the recombination event should be kept as 
small as possible, in order not to interfere with the desired recombination event. 
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The choice of the negative selection gene (contained in box 2 and/or box 6) is not critical to the invention as long 
as it is functional in the plant host, and administered in an expressionable fashion. The negative selection gene may 
for instance be chosen from the group consisting of aux-2 gene from the Ti-plasmide of Agrobacterium , the TK-gene 
from SV40, cytochrome f450 from Streptomvces qriseolus , the Adh-gene from Maize or Arabidopsis , but any gene 
5 encoding an enzyme capable of converting harmless substances into harmful substances may be used. 
Parts 3 and 4 of the targeting construct are most critical with respect to the invention. 

In a first preferred embodiment of the invention, mutations are introduced in a selected locus (hereinafter called the 
target locus) requiring that the remainder of the sequence of the target locus (i.e. on both site of the mutation to be 
introduced) must be kept intact. Therefore, the sequence immediately next to the mutation must also be provided in 

10 box 4 of the construct. Thus, mutations may be introduced inside functional genes, or functional parts of genes, such 
as regulatory regions, signal sequence, sequences encoding parts (such as functional domains) of the mature protein, 
or even introns, as well as other functional target loci, not necessarily encoding protein, without changing the sequence 
of the target locus contiguous to the mutation. In this situation both box 3 and box 4 will have homology with the target 
locus, whereby box 3 serves to promote the homologous recombination event, and is therefore indicated hereinafter 

is as the recombination box, whereas box 4 comprises the mutation as well as the sequences of the target locus that 
should not be changed, indicated hereinafter as the complementing box. Since the complementing box can only be 
integrated into the target locus after homologous recombination within the recombination box, the probability of a re- 
combination event within the latter box should be favoured. It is well known that the longer the region of homology the 
larger the probability of recombination in this region. It is therefore preferred, that the recombination box be sufficiently 

20 long to promote homologous recombination, and furthermore, that it be significantly longer than the complementing 
box. It should also be understood that if the recombination box of the construct represents the upstream (5') region, 
the complementing box constitutes the downstream (3') region of the target locus, and vice versa. Furthermore box 3, 
4, and E should have the same 5' to 3' orientation. 

Most importantly, the order of boxes 3, 4, 5, and E can not be changed in this embodiment of the invention, although 

25 the entire fragment may be inserted in the opposite orientation with respect to the T-DNA borders. Likewise, also the 
entire fragment including the negative selection markers may be inverted with respect to the T-DNA borders. 

In the constructs, the complementing box may carry three kinds of mutations, which per definition are located 
immediately next to the recombination box. These mutations comprise insertion of basepairs, which may be from one 
to several thousands of basepairs, a replacement of basepairs, not changing the number of basepairs of the target 

30 locus, or a deletion of basepairs reducing the number of basepairs, which may be 1 basepair or as much as several 
thousands of basepairs of the target locus, or combinations of these. 

It should be understood that the recombination box in the construct does not necessarily have to start at the precise 
beginning of the target locus as it is defined. It might as well begin before or after the start of the target locus. However, 
per definition, it ends exactly at the point where the mutation of the target locus must begin. The complementing box, 

35 per definition, starts exactly with the first nucleotide of the mutation, in the case the mutation is an insertion or a re- 
placement, but does not necessarily have to end at the last nucleotide of the target locus, although also per definition, 
it includes the last nucleotide of the target locus. In the case the mutation constitutes a deletion of basepairs, the 
complementing box begins exactly where the deletion ends. Of course, the complementing box may contain more than 
one mutation, even more than one type of mutation. If, in this case, the number of bases separating the the different 

40 mutations is large with respect to the regions of uninterupted homology, it is preferred that such mutations are integrated 
into the plant genome one after the other, i.e. in different transformation experiments. 

In principle, an entire expressible gene may be inserted into the target locus, which gene may itself provide a selectable 
or screenable trait. In this case box 5, representing an expressible positive selection gene, may be absent in the 
construct. Consequently, the need for a region of homology outside the target locus (box E) is also lost. 

45 in a second preferred embodiment of the present invention, box 4 may represent a sequence that is entirely non- 

homologous with respect to the target locus. Thus, it may be an exogenous sequence with respect to the host, for 
instance a sequence derived from a different variety of the same plant species, a different plant species, an organism 
other than a plant, a synthetic sequence, or a genetically manipulated exogenous sequence, but also a sequence that 
is endogenous to the plant host, derived from a different locus of the host genome. In the latter case the chances of 

50 undesired homologous recombination should be minimized by reducing the length of box 4 with respect to box 3. As 
for the situation mentioned above, box 5 may be absent if after recombination a selectable or screenable trait arises, 
eliminating the need for an additional selection gene and box E. 

In a third embodiment of the invention, the site directed mutagenesis of the genome merely aims at total inactivation 
of functional genomic regions. These functional regions include, but are not limited to, genes, regions involved in 

55 regulation of gene expression, DNA replication, and the like. Inactivation may be achieved by creating deletions inside 
the target locus, replacements (e.g. introduction of stopcodons) or insertions (e.g. causing frameshifts and the like), 
using any of the constructs mentioned above. It will appear to an expert, that in those cases the inactivation of the 
target locus can be selected or screened for directly due to the apparent phenotype of the target locus, there will be 



7 



EP 0 436 007 B1 

no need to provide for positive nor negative selection genes. The construct may just contain box 1 , 3, 4, and 7, in which 
box 4 may contain any kind of mutation that is suitable to inactivate the target locus. 

A somewhat different targeting construct, indicated as •insertion vector' is depicted in figure 1 3B. In addition to the 
construct outlined in figure 13A, this construct may be used to introduce insertions (box I) into the target locus. Here, 
6 Box 3 and 4 represent DNA sequences homologous to the target locus, in which the basepair sequences of the boxes 
still have the same 5' to 3' order as in the target locus, but the entire boxes changed places with respect to the situation 
in the target locus. A functional alternative of the construct depicted in figure 13B is one in which both box 3 and 4 
point to the other side. 

Although targeting of endogenous DNA sequences is exemplified using a target construct in which the promoter 
10 and part of the leader sequence of the small subunit of the ribulosebisphosphate carboxylase (rbcS) gene are comprised 
in the recombination box, it will appear to an expert that in principle any part, whether belonging to the regulatory 
elements of a gene, the coding regions of a gene, or any other sequence of any gene, gene fragment or different DNA- 
sequence, may be used in the recombination box in the targeting construct, in order to promote homologous recom- 
bination. To put it differently, any part of any gene or any other DNA sequence may be mutated in an exactly defined 
is manner, provided the DNA sequence of the regions immediately flanking the site to be mutated is sufficiently known, 
using a construct as depicted in figure 1 3A. The invention is not restricted to loci of which the DNA sequence is known, 
since the availability of new DNA-sequences is just a matter of time. 

Although the invention is exemplified with tobacco protoplasts and plants, also protoplasts from other species can be 
transformed using this method, both monocotyledonous and dicotyledonous, as well as other parts of plants or tissues, 

20 e.g. tuber-discs, leaf-discs, embryos, pollen, meristems, and the like, as long as they can be transformed with the 
Aqrobacterium transfer DNA-system. Plant groups of special interest in the light of this invention include but are not 
limited to the Solanaceae , Leauminosae , Umbelliferae , Cruciferae , Compositae , Alliaceae , Vitaceae , Compositae , As- 
paragaceae , Chenopodiaceae , Liliaceae , Orchideaceae , Theaceae , Coffea , Cucurbitaceae , and the like. 

In principle, the naturally occurring DNA-transfer system of Aqrobacterium could be used to practice the invention, 

25 but the size of the wildtype Ti- and Ri-plasmids hampers the manipulation of the T-DNA using recombinant DNA tech- 
niques. Therefore, modified Aqrobacterium vector systems are used in genetic engineering of plants. The plasmids 
that are subject to genetic manipulation have been trimmed to a more convenient size. With these so-called cointegrate 
vectors, the foreign DNA (whether or not next to one, or between two borders) can, via small plasmids that are capable 
of replicating in E.coli , be brought onto the Ti- or Ri-plasmid by means of homologous recombination (Hille et aj., 1 983b; 

30 Barton and Chilton, 1983; Zambryski et a]., 1983; Deblaere et al ., 1985; Fraley et al., 1985). 

Especially preferred in accordance with the invention is the so-called binary vector system, in which the T-region is 
carried by the binary vector, a plasmid which is capable of replicating in both E.coli and A.tumefaciens , while the Vir- 
region resides on a helper Ti- or Ri-plasmid (De Framond et aj., 1983; Hoekema et al., 1983; Hoekema et al., 1984a). 
The T-region now only contains border sequences between which the genes that are to be transformed can be cloned. 

35 Hereby the enhancer element is present next to the right border. 

Also when the T-DNA is located on the chromosome of Aqrobacterium it can be transferred to the plant cell, provided 
that the virulence genes are present in trans in the same bacterium (Hoekema etaj., 1984a). 

When Ti- or Ri-plasmids are introduced into bacteria that are related to Aqrobacterium , such as Rhizobium (Hooykaas 
et al., 1977) or Phyllobacterium (Van Veen et al., 1988), then the T-region of these plasmids appears still to be trans- 
40 ferred to the plant cell, upon coincubation. 

The system of Agrobacterium -mediated gene-targeting may be applied, among others, for the jn situ modification 
of genes of interest in the field of human nutrition, food-processing, animal-fodder, industrial non-food applications and 
plant genes related to environmental fitness. 

The in situ modifications of genes may affect any aspect of gene functioning, regulation of gene expression, or protein 
45 functioning. 

Affecting gene functioning includes any mechanism of complete inactivation of genes (including members of multigene 
families or other sequences) the expression of which is not desired. Such genes may be encoding key-enzymes in 
metabolic pathways, such as fatty acid, carbohydrate-, or secundary pathways. Inactivation of such genes may cause 
specific alteration of these metabolic routes. Such alterations may be desirable as to inhibit the formation of metabolites 
50 that are unhealthy (such as specific alkaloids, and the like) untasty, or otherwise undesirable. It may be very advanta- 
geous to inactivate genes that are involved in fruit ripening, flowering, pollination, and the like. Also genes may be 
inactivated in plants that are used as raw material in food processing, production of food ingredients, pharmaceuticals, 
or for industrial use and the like. 

The invention is also very useful for the inhibition of the formation of proteins or polypeptides that are themselves 
55 undesired, for instance those that are toxic to humans, domesticated animals and cattle, and the like. 
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regulation of gene expression 

In a slightly different embodiment of the invention genes of interest may not be entirely inactivated, but their reg- 
ulation of expression modified. Such modifications may involve non-modulated stimulation of expression (overexpres- 

5 sion), (partial) inhibition of expression, responsiveness to certain internal (hormones, metabolites) or external (heat, 
cold, drought, light-intensity, day-length, tactile stimuli, chemicals, pathogens) signals. For purposes of simplicity reg- 
ulation of gene expression includes also protein-sorting (transport of proteins to specific compartments of the cell, or 
the extracellular space). It is known that certain DNA sequences are involved in non specific stimulation or inhibition, 
other sequences in responsive stimulation or inhibition of gene expression. Many of such sequences such as regulatory 

io elements (including promoters, transcription and translation enhancers, light-responsive elements, and the like), se- 
quences coding for signal peptides, organelle import domains, transit peptides and the like are known in some detail, 
and many more will come at hand in the near future. It has been established that many of such sequences constitute 
a functional domain of itself (i.e. functioning irresponsive of other functional domains), which creates the possibility to 
combine hitherto nonexisting combinations of domains, thus altering regulation ad libitum. However, it seems that 

15 genomic localisation may play a significant role in the regulation of newly introduced gene-constructs as well, some- 
times completely overruling other factors affecting gene regulation. Sometimes, genes can not be expressed in a 
specific plant host at all. This is one of the major problems in genetic engineering of plants. The present invention may 
help to solve this problem in creating the possibility to target newly introduced gene-constructs to genomic loci that 
have a somehow predictable expression mode. For instance, genes that need overexpression can be targeted now to 

20 genomic loci that are known to be very active. In order to achieve this highly active loci can be chosen such as the loci 
on which some of the rt>cS genes, the chlorophyll a/b binding protein genes, and the like, are located. The DNA- 
sequences of interest may be introduced with or their own regulatory elements, such as promoters, enhancers, tran- 
scription terminators and the like, or fused to the regulatory elements present in the target locus. 

The problem of poor expression also plays a role when one wants to alter the structure of endogenous genes, as 

25 to modify protein structure, or domains involved in routing, without changing the genomic location. Formerly such 
genes were isolated and modified in vitro, and subsequently reintegrated into the plant genome using standard trans- 
formation techniques. By doing so, the altered gene-construct reintegrates randomly into the plant genome, often losing 
its specific regulation. Using methods provided for by the present invention, the gene can now be modified in situ , with 
little chance to abolish its mode of expression. 

30 For the example, the amino acid content of the encoded protein can be changed by insertion, deletion and/or substitution 
of amino acids following replacement of some or all of the basepair sequence encoding for the protein. Genes of 
particular interest are genes coding for abundant proteins like seed storage proteins such as zein, napin, phaseolin, 
storage albumin and the like, of which the nutritional value can be raised by introducing more essential amino acids, 
such as lysine or tryptophane. 

35 Furthermore, the invention can have considerable advantage when used for the in situ modification of proteins involved 
in some form of environmental stress. These genes often have a very intricate mode of gene regulation which one 
might not want to disturb by changing the genomic location of the gene, although it might be advantageous to change 
the protein itself. For instance the mode of action, stability or pathogen range of a particular protein can be changed. 
On the other hand mutations may be introduced into a gene of interest expressly causing altered gene expression at 

40 the protein level. This may be achieved for instance by altering codon usage. These mutations can be introduced using 
techniques very well known in the art. 
The following figures illustrate the invention. 

Figure 1 : 

45 

Construction of plasmids pSDM4 and pSDM7, which contain the intact Km r gene that is functional in plant cells. 
These plasmids served as basis for the construction of the different defective genes (figures 2, 3 and 5). The following 
cloning steps are depicted here: 1) the transfer of the Bglll/Hindl II fragment from pMOGEN24 to pUC12 ( xBam HI xHin- 
dlll), 2) the replacement of the transcription terminator of the nopaline synthase gene (3'NOS) with that of the octopine 
so synthase gene (3'OCS), 3) the introduction of a 8 bp synthetic DNA fragment (linker) containing a Xho l restriction site 
at the Eco RI restriction site and 4) the transfer of the supF containing fragment from plasmid jiVX to the Bam HI site 
of pSDM4. Abbreviations and symbols are explained in the legend. 

Figure 2: 

55 

Construction of the defective Km r genes 5'AI, 5AII and 3'AI starting from plasmid pSDM7. The intact Km r as it 
present on pSDM7 is depicted linear. Beneath it the defective genes are depicted with lines. The lines indicate which 
DNA sequences of the intact Km r gene from pSDM7 are still present in the defective genes. 
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The 5'AI was obtained by replacing the Xma lll/Bcll fragment, after filling in with Klenow-polymerase, with an oligonu- 
cleotide of 10 bp, on which an Eco RI site (EcoRI -linker) is located. Replacement of the the Tthlll.l/Bcjl fragment after 
filling in with Klenowpolymerase, with an EcoRI -I inker (10 bases) resulted in 5* All. In 3'AI the 3' region of the Km r gene 
till the Rsrll site was removed. The supF gene was cloned distal from the mutant Km r gene. Abbreviations and symbols 
5 are explained in the legend. 

Figure 3: 

Cloning of the defective Km r genes (see figure 2) as Xho l/Hindlll fragment into the binary vector pSDM5 (xXhgl x 
10 Hindlll). In this way the defective genes were located next to the HnYgene between the T-DNA border sequences (see 
figure 4). The plasmid pSDM5 was derived from pMOGEN24 by replacing the Sph l fragment of pMOGEN24, after 
blunt-ending with Klenowpolymerase, with a Xho l-linker (10 bp synthetic DNA fragment containing a Xho l restriction 
site). See also legend. 

15 Figure 4: 

Overview of the T-region of plasmid pSDM100 on which the intact Km r gene is located. Beneath it, with black lines 
is indicated which DNA-sequences of the T-region of pSDM100 are also present in the T-region of pSDM102 and in 
that of pSDM104. See also legend. 

20 

Figure 5: 

Construction of the 3* deletion mutants of the Km r gene starting from the plasmid pSDM4. A 75 bp Hindi ll/BamHI 
fragment with synthetic octopine type left border sequences was introduced behind the intact Krrf gene on pSDM4, 

25 resulting in pSDM8. The sequence of the fragment containing the synthetic left border is shown in figure 6. Through 
deletion of an Eco RV /Bam HI- or an Eco RV/Rsrl I -fragment, respectively the plasmids pSDM8* and 3'Alla were ob- 
tained. In this way the intact or defective Km r gene is located between border sequences. In pSDM9 the 110 basepair 
Hindlll/Bqlll fragment of the plasmid pRAL391 2 (Hoekema et aj., 1 985) was placed behind the Km r -gene. This fragment 
contains the wildtype octopine left border sequence. The plasmids pSDM9* and 3'Allb were obtained from pSDM9 by 

30 deleting the Accl/BstEII fragment or the Acc l /Rsr ll fragment, respectively. The T-regions of the plasmids pSDM8*, 
pSDM9*. 3'Alla and 3'Allb were transferred as Eco RI/Hindlll fragment to the binary vector pLM997 (see figure 7), 
resulting in the vetors pSDM200, pSDM210, pSDM201 and pSDM211. 

Figure 6: 

35 

The sequence of the synthetic Hjnd II l/BamH I fragment on which the left T-DNA border of the octopine type is 
located. The recognition sites for different restriction enzymes are indicated above and beneath the sequence. 

Figure 7: 

40 

Construction of plasmid pLM997. Plasmid pLM997 was derived from pMOGEN24 by deletion of the BgJII fragment 
carrying the T-DNA followed by the introducion of a so-called polycloningsite (a synthetic DNA-fragment containing 
several recognition sites for restriction enzymes) in the remaining BgJII restriction site. The Eco RI and Hindlll sites in 
which the EcoRI/Hindlll fragments of plasmids pSDM8*. pSDM9*, 3'Alla and 3'Allb were inserted, are indicated with 
45 arrows. 

Figure 8: 

Illustration of the detection of recombination events via Southern analysis and the polymerase chain reaction 
50 (PCR) . Genomic DNA of Km r plant lines obtained from the cotransformation experiments was digested with Eco RI/ 
Bel l and Hindlll. The fragments that were detected after blotting and hybridization with the 3'OCS- or the NPTII-probe 
are depicted. A 2.1 Kb fragment is indicative for the presence of a repaired NPTII gene. Digestion with Hin dlll only 
generates junction fragments (i.e. fragments that contain the junction between plant DNA and the integrated T-DNA 
copy). The number of Hindlll fragments was used as an indication for the number of T-DNA inserts in the plant genome. 
ss pgr analysis using primers 2 and 3 was performed on plants in which the 2. 1 Kb Eco RI/Bcll fragment was not detected. 
The primers anneal within the regions that are deleted in the defective NPTII genes. Only when a repaired NPTII gene 
is present a 593 bp fragment is amplified. Primers 2 and 3 are indicated in the figure by arrows that point from the 5'- 
to 3'-end. 
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sl_B=synthetic left T-DNA border repeat; B=Bcll; E=EcoRl; H=Hindlll. 
Figure 9: 

5 Detection of recombinants after Aqrobacterium mediated transformation of protoplasts of the target line with the 

T-DNA construct of pSDM101 using the polymerase chain reaction (PCR). The target locus of the plant line 104(.1.6) 
is depicted as are the T-DNA constructs of pSDM100 and pSDM101. The 20 basepair long oligonucleotides 1 and 2 
that were used in the PCR, are indicated with arrows that point from the 5'- to the 3' -end. 

io Figure 10: 

A. The translational SSU-NPTII fusion constructs (see example 9 and 11). The constructs were cloned as Hindi! I 
or Pstl/HindlH fragment into the poly-cloning sites of plC20R, resulting in the plasmids pNTSSI, pNTSS2, pNTSS3 
and pNTSS4. pNTSSI contains the fusion gene that consists of the complete sequence of the SSU clone (Hindi 1 1 

15 fragment) and has the 1690 bp BamHI NPTII module from pSDM53 inserted in the 4th. exon. The coding sequences 
of the SSU gene (exons) are indicated with the boxes numbered 1 to 4. Non-coding regions of the gene such as 
promoter, introns and terminator are indicated by a single line. Construct pNTSS2 was derived from pNTSSI and lacks 
the promoter and part of the coding region up to the Pstl site (P1 ). Plasmid pNTSS3 contains the complete SSU clone 
that has the Bglll /Bam HI NPTII module inserted in the 2nd. exon. pNTSS4 contains the promoterless fusion gene of 

20 pNTSS3. Below the constructs the length of the fusion genes is indicated in basepairs. From the pIC plasmid the fusion 
constructs were excised as Sall /Xho l fragment and cloned in the SaJI/Xhol sites between the borders of the binary 
vector pSDM1 4, resulting in pNTSS1 1 A/B, pNTSSI 2A/B, pNTSS21 A/B and pNTSS22A/B. A/B indicates the orientation 
of the fragment in pSDM14. Only the B orientation is shown here. The construction of pSDM14 is depicted in figure 
1 1 . O.D.= overdrive sequence. 

25 Restriction sites: B1 =Bam HI, B2=Bqlmll,E1=EcoRI, H1=Hpal. H3=Hindlll. K1=Kpnl, P1=Pstl, S1=Sall, X1=Xhol. 

B. The EcoRI /Xma lll fragment that was used to construct pSDM53 was composed of 2 complementary oligonu- 
cleotides (I and II). The sequence of the fragment is shown and restriction sites are indicated above the sequence. 
Below the sequence the amino acids of the translation product are given. The numbers 2 and 3 below the amino acids 
refer to the second and third codon of the NPTII coding sequence. At the bottom of the figure the DNA and amino acid 

30 sequence of the fusions between the coding regions of SSU and NPTII are shown. 

Figure 11 : 

The construction of the binary vector pSDM14. See example 10 for detailed description. See also the legend. 

35 

Figure 12: 

Plasmid pNTSS512 contains an alternative T-DNA construct to target the SSU locus in tobacco cells. The con- 
struction of this plasmid is described in example 12. The boxes 3' and 5* indicate the downstream and upstream non- 
40 coding sequences of the aux-2 gene. The non-coding sequences of the SSU-gene are indicated by a line and the 
coding regions (exons) are depicted by the boxes 1 to 4. See also the legend. Restriction sites: B1=BamHI, B2-Bglll, 
E1=EcoRt, Hl=Hpal, H3=Hindlll, K1=Kpnl, P1=Pstl, S1=Sall, Xl=Xhol. 

Figure 1 3: 

45 

A consensus for T-DNA constructs that can be used to achieve gene targeting. See chapter "detailed description" 
for explanation of the numbered boxes. 

The following Examples only serve to illustrate the invention and do not mean to limit the scope of its applications. 

50 Example 1 

Transformation of tobacco protoplasts by cocultivation with Agrobacterium tumefaciens 

Cocultivation is the plant cell transformation method in which plant protoplasts and Agrobacterium are incubated 
55 together and where during subsequent regeneration from protoplast to callus selection takes place on the transfer of 
T-DNA (Marton etaj., 1979; Fraley et a]., 1984). 

For the experiments described below the following protocol for cocultivation of tobacco protoplasts with A.tumefactens 
was used. Nicotiana tabacum cv. petit havana SR1 plants were axenically grown in Magenta boxes, filled with 50-60 
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ml Daichin-agar (0.6%) solidified MS30-medium (Murashige and Skoog, 1 962; contains 30g sucrose/I). Every 5-8 
weeks apical meristems of the plants were transferred to fresh medium. Protoplasts were prepared from leaves of 5-8 
week old axenically grown tobacco plants by overnight incubation at 26°C in K3 0.4 M sucrose medium (Nagy and 
Maliga, 1 976), 1 % cellulase R10, 0.1 % Macerozyme R1 0 and 0. 1 % MES. The protoplasts were washed one time in 

5 K3 sucrose medium, diluted to 1 X1 0 s cells/ml in K3 medium containing 0.4 M glucose (K 3 G) and distributed in batches 
of 7 ml in 9 cm petridishes. They were incubated overnight in the dark prior to cocultivation with the bacteria. Agrobac- 
terium strains were grown at 29° C in LB medium containing 20 mg/l rifampicin and 50 mg/l kanamycin. End log phase 
cultures were diluted in K 3 G medium and the bacteria were added to the protoplasts at a ratio of approximately 100 
bacteria per protoplast. After three days of cocultivation the protoplasts were embedded in agarose discs by mixing 5 

10 ml protoplasts with 5 ml 0.8 % low melting-type agarose (Sigma) in Sll medium (Muller era/., 1983) containing 0.1 M 
sucrose and 0.2 M mannitol. The bacterial growth was stopped by the addition of cefotaxim and vancomycin to final 
concentrations of 200 mg/l and 100 mg/l, respectively. After 10 days 15 ml Sll medium containing either 50 mg/l kan- 
amycin or 10 mg/l hygromycin was added to the discs. Seven days later 15 ml Sll medium with kanamycin at 100 mg/ 
I and hygromycin at 20 mg/l was added. From this moment on the medium was refreshed weekly by replacing 15 ml 

15 old medium with 15 ml fresh Sill medium (100 - 150 mg/l kanamycin, 20 - 30 mg/l hygromycin). This medium is identical 
to the Sll -medium except for the mannitol concentration which is 0.1 M instead of 0.2 M. The plating efficiency was 
determined by incubation of 1/8 part of an agarose disc on liquid medium without selection. The hormone regime in 
the K3 and Sll media was 1 mg/l NAA, 0.2 mg/l BAP and 0.1 mg/l 2.4D. In the Sill medium 2.4D was omitted. Cefotaxim 
and vancomycin were added to final concentrations of 200 mg/l and 100 mg/l, respectively. 

20 Microcalli were harvested from selective or non selective medium 4 to 5 weeks after embedding the protoplasts 

and were transferred to MS30 medium (Murashige and Skoog, 1 962) containing 3 % sucrose, 1 .0 mg/l NAA and 0.2 
mg/l BAP and solidified with 0.6 % agar (Daichin). Shoots were induced on solid MS15 medium containing 1.5 % 
sucrose, 1.0 mg/l BAP and 0.1 mg/l NAA. Solid medium also contained 100 mg/l cefotaxim and 50 mg/l vancomycin 
and for selection 1 00 mg/l kanamycin or 20 mg/l hygromycin was added. Shoots were tested for Km r or Hm r by allowing 

25 them to root on MS1 5-medium to which 20mg/l hygro. or 100mg/l kana. was added. 

Example 2 

Construction of defective NPTII genes 

30 

Construction of the vectors containing different defective NPTII genes started from the binary vector pMOGEN24 
(see figure 1). This plasmid was derived from the vector pROK1 (Baulcome et al., 1986) and contains between the 
borders of the nopaline Ti- plasmid pTiT37 the genes, functional in plants, for kanamycin resistance (Km r ) and hygro- 
mycin resistance (Hm r ) in opposite orientation. The vector pMOGEN24 is obtained through standard recombinant DNA 
35 techniques (Maniatis et a]., 1982) from pROK1 by cloning the coding region of the E.coli hygromycin phosphotrans- 
ferase (Hpt) gene (Gritz et a}., 1983) as a Bam HI-fragment in the BamHI restriction site in pROK1. Consequently, the 
coding sequence of the HPT gene, from 19 basepairs in front of the translation initiation codon up to 20 basepairs 
behind the trans lationstop codon becomes located between the 35S CamV promoter and the transcription terminator 
of the nopaline synthase gene. 

40 Just in front of the actual translation startcodon (ATG) another ATG codon is present. Because this first codon 

might disturb translation from the actual codon the sequence of this codon was changed into ATA via oligonucleotide 
mutagenesis, a standard recombinant DNA technique. Accordingly, the Bam HI sites on both sides of the HPT fragment 
was deleted by filling in using Klenow polymerase (Maniatis etaj., 1982). 

The Bglll/Hindlll fragment from pMOGEN24 with on it the right border and the Km r gene was transferred to the 

45 plasmid pUC1 2 (Messing, 1983; see figure 1) after it had been digested with BamHI and Hind i II (Maniatis etaj., 1982). 
Subsequently, the transcription termination signal of the nopaline synthase gene was replaced by the termintion signal 
of the octopine synthase gene. This resulted in the plasmid pSDM2 (figure 1) on which are located succesively: 1 ) the 
right border of pTiT37 (RB), 2) the promoter region of the nopaline synthase gene up to the base in front of the ATG 
startcodon (5' NOS, Bevan et al., 1983), 3) the coding region of the NPTII gene derived from Tn5 from the Sau llla site 

50 ±10 basepairs (bp) in front of the ATG start codon till the Pstl site that is located ± 370 bp behind the TGA stop codon 
(Beck et aj., 1982) and 4) a 700 bp Pvull fragment that contains the transcription termination signal of the octopine 
synthase gene (3' OCS, Gielen et aj., 1984). 

To simplify reintroduction of the Km r gene of pSDM2 or defective mutants derived therefrom into the binary vector, a 
Xho l-linker was introduced at the EcoRI -site (figure 1). This results in plasmid pSDM4. A Xho l-site was introduced in 
55 the binary vector pMOGEN24 as well, by replacing a Sph l fragment with a Xho l-linker. Sph l cuts in pMOGEN24 just 
before the right border and within the coding region of the Km r gene. Thus, plasmid pSDM5 was obtained (figure 3). 
Finally, to be able to isolate the T-DNA constructs integrated in the plant genome by recombination via a Lambda vector 
library (Maniatis etaj., 1982) , the so-called supF gene located on a EcoRI fragment derived from plasmid rcVX (Seed, 
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1983) was cloned next to the Km r gene. Using the so-called amber/suppressor system, also used and described by 
Smithies etaj., (1985), the Lambda phage library can be enriched for the fragments that contain this supF gene. The 
Eco RI restriction sites of the supJF fragment were filled in with Klenow polymerase and ligation of BamHI-linkers to the 
fragment was followed by digestion with Bam HI. The Bam HI fragment was ligated into the Bam Hi site of pSDM4, 
5 resulting in plasmid pSDM7 (figure 1). 

The defective Km r genes were derived from pSDM7 (see figure 2). As an illustration the construction of one of the 
defective genes, namely the 5'AII construct, is extensively described below. The construction of other defective KnY 
genes is roughly indicated in figure 2. 

For the construction of 5'AII, pSDM7 was cut with restriction enzymes Tthl 1 1.1 and Xma lll, the ends were made blunt 

10 by filling in with Klenow polymerase and on the site of deletion an Eco RI -I inker (10 bp) was inserted. Due to this 
modification a sequence that codes for an active region of the NPTII enzyme wasdeleted (Beck et a]., 1982). The 
sequence at the site of the mutation was checked using the dideoxy-sequencing method (Sanger et al. , 1977). Like 
the other defective Km r genes 5'AII was cloned as a Xhol/Hindlll fragment into the vector pSDM5 that had been cut 
beforehand with Xho l and Hin dlll. The resulting plasmid was called pSDM102 (figure 3, 4 and 8). 

is The 3'A mutants used in the experiment described in Example 3 are depicted in figure 5. Starting from the intact 

Km r gene located on plasmid pSDM4, two types of constructs were made. One type was obtained by introducing a 75 
bp Hin dlll /Bam HI fragment containing the synthetic octopine left border sequence (figure 6) behind the intact Km* gene 
on pSDM4. This resulted in plasmid pSDM8. Through deletion of a Eco RV /Bam HI- or a EcoRV/Rsrl I -fragment respec- 
tively the plasmids pSDMS* and 3'Alla were obtained. The defective gene (3'Alla) lacks part of the coding region of 

20 the NPTII gene and the transcription termination signal. For the other construct type a Hindlll/Bqlll fragment, containing 
the wildtype octopine left border sequence derived from the plasmid pRAL391 2 (Hoekemaetal., 1985) was transferred 
to pSDM4 that had been digested with Hindlll and Bam I. From the obtained plasmid pSDM9 the Acc l /Rsr ll fragment 
on which part of the coding region of the NPTII gene, the transcription termination signal (3'OCS) and a part of the 
Hindlll/Bglll fragment from pRAL39l2 is located, is deleted. This results in 3'Allb. An intact Km r gene with the same 

25 border fragment behind the 3'OCS was obtained by deleting the Acc l/BstEII fragment (pSDM9*). The restriction site 
for BstEII is located behind the transcription termination signal at the end of the 3'OCS part. The constructs with an 
intact or defective Km r gene between the right and left border (respectively pSDM8*, pSDM9*, 3'Alla and 3'Allb) were 
transferred as Eco RI /Hin dlll fragment to plasmid pLM997 (see figure 7), that was cut beforehand with Eco RI and 
Hindlll. This resulted in the binary vectors pSDM200, pSDM210, pSDM201 andpSDM211 , respectively Binary plasmids 

30 were mobilized by a triparental mating (Ditta etal., 1 980) to a rifampicin resistant (rif r ) Agrobacterium strain, that already 
contained a helper Ti-plasmid without T-DNA (e.g. Hoekema et al., 1983; Deblaere et a|., 1985). Conjugants were 
selected on LB agar medium (Maniatisefa/., 1989) containing 20mg/l rifampicin and 50 mg/l kanamycin. Agrobacterium 
strains were named after the binary plasmid they contain. 

35 Example 3 

Homologous recombination in tobacco protoplasts between two simultaneously introduced T-DNA's 

The possibility of homologous recombination between two T-DNAs in a plant cell was tested by transforming to- 
40 bacco protoplasts with two T-DNAs, by simultaneously cocultivating the tobacco protoplasts with two different Agro- 
bacterium strains. Both strains were derived from the same non-oncogenic helper strain, but harbour a different binary 
vector. The T-DNA of each binary vector contained a different defective NPTII gene, one with a deletion in the 5' part 
of the coding region of the gene (pSDM102, figure 4 and 8) and the other with a deletion in the 3' part of the gene 
(pSDM201, figure 5 and 8). In example 2 the construction of the different defective NPTII genes is described in extenso. 
45 Tobacco protoplasts were cocultivated according to the procedure described in example 1, with the following 

strains: 1) 1,5x10 6 protoplasts with SDM102 alone as a negative control, 2) 1 .5X10 6 protoplasts with SDM201 alone 
or SDM211 alone as a negative control, and 3) 1,5x10 6 protoplasts with both SDM102 and SDM201 or both SDM102 
and SDM211. 

(Co-)transformation frequencies of the various constructs were determined in a smaller experiment. Here, 5x10 s pro- 
50 toplasts were cocultivated with both SDM102 and SDM200 or both SDM102 and SDM210. For strain SDM102 the 
transformation frequency was determined by using the hygromycin resistance gene present on the T-DNA of plasmid 
pSDM102. Strains SDM200 and SDM210 were used to estimate the frequency with which the T-DNA of respectively 
strains SDM201 and SDM211 was transferred to tobacco cells. The T-DNA of the binary vector of strain 
SDM200/SDM210 is similar to that of S DM201 /SDM2 11 except that it contains an intact NPTII gene construct between 
55 right- and left T-DNA border instead of the 3' deleted gene. Approximately 5-7% of the protoplasts regenerated to 
callus. Of these surviving calli ± 20% appeared to be transformed with the 102 construct, whereas for both the construct 
200 (=201) and the construct 210 (=211) transformation percentages were observed of + 15%. Of the calli that had 
already been transformed with one construct (Hm r or KnT) approximately 30% appears to be transformed with the 
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other construct (Hm r or Km^. 

In the cocultivation experiment where protoplasts were co-transformed with a 5'A-construct (102) and a 3'A-construct 
(201 or 21 1 ) restoration was found in 1 -4% of the co-transformed calli. In the negative controls only one Km r calli was 
obtained. This callus did not contain a repaired KaY gene and progeny of this callus did no longer show kanamycine 

5 resistance. A clear difference in transformation frequency between the 201 construct (3'A with synth. LB) or the 211 
construct (3'A with wildtype LB) was not observed. 

The obtained Km r calli were regenerated into plants as described in example 1 . In leaf extracts from these plants 
NPTII activity could be detected, using non-denaturing gels (Piatt and Yang, 1987), at the correct position in the gel. 
The plants were also analysed on the DNA level. Accordingly, proof for the presence of a restored Km r could be pro- 

10 vided. 

Theoretically, the NPTII gene could have been restored via homologous recombination in the bacterial background. 
This could only be possible if transfer of the binary vectors between the two bacterial strains should occur. Crossing 
experiments described in the following example (4) excluded this possibility. 

15 Example 4 

The control crossing-experiment 

To test whether transfer of binary vectors occured between Agrobacterium strains, a donor- and a recipient-strain 

20 were coincubated for 3 days at 28°C. A total of 10 9 bacteria of each strain was mixed and spotted on a nitrocellulose 
filter lying on either solid LB medium or on a layer of tobacco suspension cells that had been plated on solid MS30 
medium containing 0.5 mg/l of the plant hormone 2.4D. In addition similar coincubations were performed in the presence 
of E. co// helper strain RK2013 which is used in triparental matings (Ditta etal., 1980). The donor strain SDM201 is rif 
and contains the binary vector pSDM201 that carries a bacterial gene for kanamycin resistance (Km*). The recipient 

25 strain LBA285 is a spontaneous spectinomycin resistant (spcO derivative of strain LBA202 and does not contain any 
plasmid. LBA285 behaves like a wildtype recipient for Ti-plasmids in conjugation experiments. (Hooykaas etal., 1980). 
If transfer of the binary vector pSDM201 should occur from SDM201 to LBA285, spCKm r colonies would be found on 
selective plates. The bacteria were plated on LB medium containing 250 mg/l spectinomycin and 50 mg/l kanamycin 
after coincubation. Resistant colonies were found at a low frequency (0.8x1 0* 8 ). These were not genuine transconju- 

30 gants, because they were all rif. Indeed, incubation of strain SDM201 alone gave rise to spc r rif r Km r colonies at a 
comparable frequency. From this we concluded that these colonies represent spontaneous spc r derivatives of strain 
SDM201. Transfer of the binary vector did occur when the donor and recipient strain were coincubated together with 
E. colt helper strain RK2013 (Ditta et ai, 1980). This confirmed that genes essential for efficient transfer of binary 
vectors are not present in the strains used in our transformation experiments but have to be provided in trans to obtain 

35 conjugation. When E.co// strain RK2013 was provided as helper, the frequency of transfer after coincubation on MS 
medium in the presence of plant cells (8x10 _7 /recipient) was even lower than when coincubation was performed on 
bacterial (LB) medium (1x10" 3 /recipient). 

Consequently, we can conclude that homologous recombination between the T-DNAs had indeed taken place after 
co- introduction in the plant cell. 

40 

Example 5 

Southern blot analysis of plants derived from Km r -calli 

45 Plant DNA was isolated from not fully expanded leaves of plants in the growth room as described (Mettler, 1987) 

and purified on a CsCI-gradient. The concentration of the obtained DNA suspension was determined by measuring 
the OD 260 . Approximately 10u.g of genomic DNA was used for digestion with restriction enzymes. Following separation 
on a 0.7 % agarose TBE gel (Maniatis et ai, 1989) the DNA was transferred to a Hybond N membrane (Amersham; 
Cat. No. RPN.303N ) by capillary blotting and the membrane was (pre)hybridized according to the Hybond N protocol. 

so Final washing was performed in 0.3xSSC, 0. 1 % SDS at 65°C. DNA probes labelled with [cc^Pj-dCTP (specific activity: 
0.5 - 1x10 9 dpm/jig DNA) were obtained using the mixed primer method (Boehringer Mannheim kit; Cat. No. 10044 
760). The chromosomal DNA isolated from plants regerated from kanamycin resistant calli was analysedaccording to 
the method described above. 

Chromosomal DNA isolated from 2 transgenic plants each containing a single copy of the 102 construct, from a plant 
55 containing the 1 00 construct and from a non-transformed N.tabacum cv. petit havana SR1 were used for reconstruction. 
In figure 8 is depicted which internal bands are to be expected after digestion with EcoRI and Bel l. The 5'deletion 
construct (102) gives an internal band of 1.6 kilobasepairs (Kb), while the intact (correctly repaired) Krrf gene should 
give a band of 2.1 Kb. Digestion with Hindlll only generates so-called junction fragments. When the probe only com- 
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prised the 3'part of the Km r gene (e.g. the 0.7 Kb Pvu ll fragment with the 3'OCS, Gielen et a]., 1983) integrated copies 
of the 3'deletion mutant were not observed on the blot, which simplified the interpretation. 

The expected 2.1 Kb fragment, corresponding with a repaired NPTII gene, was present in almost all Km r plants tested 
(figure 8). In case the 2.1 Kb fragment could not be detected the presence of a repaired gene could be shown by PCR- 
5 analysis using primers 2 and 3 (figure 8). 

Example 6 

Construction of transgenic tobacco plants containing a defective NPTII gene as target-locus for homologous 
10 recombination 

Transgenic tobacco plants were obtained by cocultivating leaf discs of axenically grown tobacco plants (Horsch 
et a]., 1985) with bacterial strain SDM104. After cocultivation, the leaf discs were placed on solid MS30-medium with 
callus inducing hormones (1.0 mg/l NAA and 0.2 mg/l BAP) and the antibiotics cefotaxim (200 mg/l) and vancomycin 
is (1 00 mg/l). After one week the leaf discs were transferred to callus-inducing medium with 20 mg/l hygromycin. Resistant 
calli were cut and transferred to shoot-inducing medium (MS15, Img/I BAP, 0.1 mg/l NAA, 100 mg/l cefotaxim and 50 
mg/l vancomycin). Shoots were cut and tested for Hm r by allowing them to root on hormoneless MS30-medium with 
20 mg/l hygromycin. 

The transgenic plants obtained with the method described above were analysed at the DNA level (see example 5). 
20 Subsequently, plant lines were selected that showed a simple T-DNA integration pattern. Plant line 104(.1.6) trans- 
formed with the 104-construct was used as acceptor plant for the targeting experiments. This line appeared to have 2 
T-DNA copies integrated in inverted orientation at the same position in the plant genome (figure 9). 

Example 7 

25 

Restoration of a defective NPTII gene in a transgenic tobacco plant via homologous recombination using 
Agrobacterium-mediated DNA-transfer 

Protoplasts of transgenic plant 104.1 .6 were cocultivated with Agrobacterium strain SDM101 . Cocultivations were 
30 carried out with approximately 2x1 0 7 protoplasts. To determine the transformation frequency 1x10 s protoplasts were 
cocultivated with Agrobacterium strain SDM1 00. The cocultivation experiments were carried out according to the pro- 
cedure described in example 1. In two independent experiments, protoplasts of plant line 104 were cocultivated with 
an Agrobacterium strain harbouring the binary vector pSDM101. The plasmid pSDM101 contains a NPTII gene with 
a 5' deletion, next to the hygromycin resistance marker (figure 9). The transformation experiments resulted in 285 and 
35 281 kanamycin resistant calli, respectively. In most of these calli gene targeting had not occurred. Results which are 
not shown here suggested that the 5' deleted NPTII gene at the repair T-DNA had been fused to an endogenous plant 
gene. 

In recent articles on homologous recombination in animal cells the polymerase chain reaction (PCR) technique is used 
for quick detection of a homologous recombination event (Kim & Smithies, 1988; Zimmer and Gruss, 1989). Also in 
40 our experiments we used the PCR method to screen for kanamycin resistant calli in which an intact NPTII gene had 
been formed via homologous recombination (Figure 9). A PCR with two primers that anneal within the regions deleted 
in either the target NPTII gene or the repair construct should result in amplification (of a 979 bp size fragment) only if 
an intact NPTII gene is present. In this way a total number of 21 3 calli was screened. Three calli appeared to be PCR- 
positive and plants were regenerated from these calli resulting in plant lines 1, 2 and 3, respectively. 

45 

Example 8 

Molecular analysis to detect targeting events. 

so The chromosomal DNA of the plant lines was analysed using the Southern blot method described in example 5. 

DNA was cut with the restriction enzymes EcoRI/Bc] I and Hindi II, the fragments were separated on gel and subse- 
quently transferred to Hybond-N membrane. Hybridisation was performed with an internal NPTII probe (the 610 bp 
Xmalll/Rsrll fragment, see figure 8). 

In plant line 1, one of the 3' deletion mutant copies of the NPTII gene present on the target locus had been restored 
55 by the incoming T-DNA via homologous recombination. Wildtype NPTII activity could be detected in leaves of this plant 
line (Piatt and Yang, 1987), thereby confirming the presence of an intact NPTII gene. 
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Example 9 

Construction of the SSU-fusion genes 

5 In order to isolate an active member of the SSU multigene family a subgenomic library of N.tabacum SR1 was 

constructed in lambda phage PDJII (Maniatis etal ., 1982). Using a probe specific for a SSU cDNA-clone of N.tabacum 
cv. petit havana SR1, a clone containing a SSU-gene with 3 introns (Mazur et a]., 1 985) was isolated from this library. 
The active gene is located on a 2.4 Kb basepair Hindi 1 1 fragment. Both the restriction map and the DNA-sequence of 
the clones isolated by us corresponded with the published data (Mazur etaj., 1985). The Hindlll fragment was cloned 

10 in plC19H (Marsh et a]., 1984) resulting in plasmid pSIC1. This gene was used for the construction of chimaeric SSU- 
NPTII genes (figure 10). To do this a NPTII insertion module was constructed (clone pSDM53) by: 

a) replacing the Eco RI /Xma lll fragment of pSDM4 (see figure 1) containing the nopaline synthase promoter and 
the start of the coding region by a 51 basepair synthetic Eco RI /Xma lll fragment (see figure 1 0B) on which a unique 

*5 Bam HI and a unique BgJII restriction site are located. 

b) removing both Pstl restriction sites from the NPTII-sequence (Beck et aj., 1982). The Pstl site in the coding 
region was removed by changing a G into an A on position 1733 in the Tn5 sequence (Beck et a]., 1982), using 
M13/oligonucleotide mutagenesis (mutagenesis kit from Biorad). Consequently, no changes were introduced in 
the amino acid sequence of the NPTII protein. The other Pstl site is located outside the coding region and was 

20 removed by cutting with Pstl, blunt-ending with Klenow in the presence of dCTP, subsequently cutting with Sma l 

and closing by ligation. Due to this the fragment ranging from base 251 9 till base 2656 in the Tn5 sequence (Beck 
et aj., 1982) of the 3' non-coding region was deleted. 

For the construction of one fusion gene the 1 ,7 Kb BamHI fragment of the promoterless NPTII and the 3' OCS terminator 
25 was inserted into a Bam HI site of the rbcS-gene on pSIC1 . The resulting plasmid pNS1 contains a translational fusion 
between the rbcS and the NPTII in the fourth exon of the rbsS gene. 

For the construction of the other SSU-NPTII fusion in the second exon, in pSIC1 the Bam HI site was removed from 
the fourth exon by filling in with Klenowpolymerase This resulted in plasmid pSIC2. Subsequently, in the second exon 
of pSIC2 a new Bam HI site was introduced using M13/oligonucleotide mutagenesis (mutagenesis kit of Biorad). Due 
30 to this a G was changed into a T and an A into a C, respectively on positions 1 383 and 1 385 in the sequence of Mazur 
et al ., (1984). In the resulting plasmid pSIC3, the 1,7 Kb Bglll /Bam HI fragment from pSDM53 was cloned into the new 
Bam HI restriction-site. The clone pNS2 thus obtained contains a translational fusion between the rbcS and the NPTII 
in the second exon of the rbcS-gene. 

35 Example 10 

Construction of the binary vector pSDM14 

The plasmid pSDMlO (see figure 7) served as a basis for the construction of the binary vector pSDM1 4. Synthetic 
40 borders and a fragment containing the overdrive sequences were transferred to pSDM1 0 by the following method. The 
overdrive sequence of pTiAch5 is located on a Bcll /Sac l fragment (14087-14710, Barker et al., 1983). This fragment 
was cloned in plC20R (Marsh et al., 1984) cut with Sac I and Bam HI. From p!C20R, the 'overdrive' was cloned as a 
Sac l /Eco RI fragment to pUC1 9 (xSaclxEcoRI). The resulting plasmid was digested with Sac l and Kpn l and a synthetic 
Kpn l /Sac l fragment containing the right T-DNA border was ligated into it. By cutting the resulting plasmid with Hindlll 
45 and Kpn l a Hindlll /Kpn l fragment containing the left T-DNA border could be cloned. From this plasmid pBINSB2 the 
EcoRI/Hindlll fragment was excised. The ends of the fragment were filled in with Klenow-polymerase, BgJII -linkers 
were ligated to it, and following digestion with BgJII the fragment was ligated in pSDM10. In this way pSDM14 was 
obtained. 

so Example 11 

Cloning of the rbcS-NPTII fusion genes in pSDM14 

From pNS1 and pNS2 the Hindlll fragments with the intact fusion genes and the Pstl/Hindlll fragments with the 
55 promoterless rbcS-NPTI I fusions were cloned into plC20R (Marsh etaj., 1984), resulting in plasmids pNTSSl, 2, 3, 
and 4 respectively (see figure 10). From there, the fusion genes were ligated as Sal l /Xho l fragments in the binary 
vector pSDM14 (figure 11) that had been cut with Sall /Xho l. 

The resulting plasmids pNTSSII (A/B), pNTSS12 (A^B), pNTSS21 (A/B), and pNTSS22 (A/B) contain between a syn- 



16 



EP 0 436 007 B1 



thetic right and left border of the octopine Ti-plasmid, the intact 4th. exon fusion gene, the promoteriess 4th. exon fusion 
gene, the intact 2nd. exon fusion gene, and the promoteriess 2nd exon fusion gene, respectively. A and B refer to the 
orientation of the Xho l/Satl fragment in plC20R. In figure 10 only the B orientation is depicted. The binary plasmids 
were crossed to a Aqrobacterium helper strain that contains the Vir-region on a helper Ti-plasmid (example 2), resulting 
s accordingly in strains NTSS1 1 , NTSS1 2, NTSS21 , and NTSS22. 

Example 12 

Site-directed mutagenesis of one of the rbcS-qenes via cocultivation of the protoplasts of Nicotiana tabacum cv. petit 
10 havana SR1 with Aqrobacterium 

In separate transformation experiments 10 7 tobacco protoplasts were cocultivated with the Aqrobacterium strain 
NTSS12 and with the strain NTSS22. These strains respectively contain the promoteriess 4th exon rbcS-NPTII fusion 
gene and the promoteriess 2nd. exon rbcS-NPTII fusion gene between the borders of the T-DNA residing on the binary 

is vector. As positive controls the strains NTSS11 and NTSS21 were cocultivated with 10 6 tobacco protoplasts. The 
cocultivation method used is extensively described in example 1 . Five percent of the protoplasts regenerated to callus 
and from the positive control it appeared that 15% of the calli had been transformed. In the cocultivation experiments 
with NTSS12 and NTSS22 Km r calli were obtained. PCR-analysis was performed on chromosomal DNA of pooled 
tissue of 10 calli (for the method see Lassner et al., 1989). In some calli the transformed T-DNA, which carries the 

20 promoteriess fusion construct had recombined within the target locus, giving rise to a functional fusion between the 
rbcS-promoter and the structural part of the NPTII gene. From these calli plants were regenerated and Southern anal- 
ysis was performed on genomic DNA isolated from these plants. In some of these plants the rbcS-NPTII-part of the T- 
DNA was found to be correctly integrated at the target locus via homologous recombination. 

25 Example 13 

An alternative T-DNA construct to achieve gene targeting to the SSU locus. 

In this part an alternative T-DNA construct to target the SSU locus is described (figure 12). In contrast with the 

30 SSU-NPTII fusion contructs this new construct does not comprise a translational fusion between the SSU gene and 
the coding region of the NPTII gene but has the NPTII coding region under the nopaline synthase promoter inserted 
in the 4th exon of the SSU gene. A small portion of the SSU promoter (±365 bp) is deleted to permit screening for 
recombinants via PCR. In addition the T-DNA aux-2 gene is introduced into the construct as a negative selection marker 
to enrich for recombinant calli. The aux-2 gene product converts ct-naphtalene acetamide (NAM) to the auxin (NAA). 

35 At high concentrations (1 0-20 mg/l) NAM is capable to promote growth of tobacco cells in auxin-free medium. However, 
cells that contain the aux-2 gene will efficiently convert NAM to the more potent auxin NAA which is toxic for plant cells 
at high concentrations (Depicker etaf., 1 988). Thus, by using auxin-free medium containing NAM at a high concentration 
it is possible to select for cells, that do not express the aux-2 gene. 

A 2.5 Kb Hindlll partial of the T-DNA of pTiAch5 (3390 to 5933, Barker, et ai, 1983; Gielen et ai, 1984) which 

40 contains the aux-2 gene was cloned into the Hindlll site of plC20R (Marsch et ai, 1 984). Sequences upstream of the 
aux-2 gene comprising the promoter and part of the coding regon of the aux-1 gene were deleted. This was done by 
digestion with Pstl (in the polylinker of pIC), producing blunt ends by using the exonuclease activity of "Klenow 1 -polymer- 
ase, digestion with Hindi (5721, Barker et ai, 1983; Gielen etai, 1984) and re-ligation of the plasmid. In this way the 
promoter sequences of the aux-2 gene were left intact. Subsequently, the aux-2 gene was cloned as a Sall/Xhol f rag- 

45 ment into the unique Sail site of pNTSS11 B. In the resulting plasmid pNTSS112 the transcription of the aux-2 gene is 
directed towards the left T-DNA border. 

Plasmid pNTSSI was digested with Xhol followed by a partial digestion with Bglll and a 3,9 Kb Xhol/Bglll fragment 
was obtained that contained most of the SSU/NPTII fusion gene exept for 365 bp which were deleted from the 5' end 
of the SSU promoter region. This fragment was cloned into the pUC derivative plCl9H (Marsch etai, 1984) cut with 

so xhol and BamHI . The BamHI insertion module containing the NPTII coding region and the OCS terminator was replaced 
by a Bglll/BamHI fragment which consisted of the NPTII coding region behind the NOS promote rregion without the 
right T-DNA border sequence. The construction of this new insertion module is described below. In the resulting plasmid 
pNTSS5 the transcription of the NPTII gene from the NOS promoter is directed towards the 3' end of the SSU gene 
which can serve as terminator of transcription. In this way the OCS terminator can be omitted which results in a reduction 

55 of the size of the insertion module. The Sall/BamHI fragment of pNTSS5 was isolated and ligated into the vector part 
of pNTSS1 1 2 cut with Sail and BamHI. Thus, the binary vector pNTSS51 2 was obtained which contained the alternative 
targeting construct (see figure 12). 

For the construction of the new NPTII insertion module the Bcll/Smal fragment of pSDM4 containing the NOS 
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promoter and the restored NPTII coding region was cloned into p!C20H (Marsch etai, 1984) cut with Smal and Bglll. 
From the resulting plasmid the fragment was excised with Xhol and BamHI and was recloned in plC19R (Marsch et 
at., 1984) cut with Xhol and BamHI. Finally, the new NPTII insertion module was introduced as a Bglll/BamHI fragment 
into pNTSS5. 

5 The binary vector pNTSS51 2 (figure 1 2) was transferred via triparental mating to an Agrobacterium strain that contains 
the Vir-region on a non-oncogenic helper Ti-plasmid (example 2). 

Example 14 

10 Transformation with T-DNA construct of plasmid PNTSS512. 

Tobacco protoplasts were cocultivated with Agrobacterium strain NTSS51 2. To determine the regeneration and trans- 
formation frequencies protoplasts were regenerated with or without kanamycin as described in example 1 . Non-trans- 
formed tobacco cells were cultivated on NAM containing medium to determine the regeneration frequency on this medium. 

is in a larger experiment protoplasts were cocultivated with strain NTSS512 and grown on auxin-free medium containing 
kanamycin and 10-20 mg/l NAM. Integration of the T-DNA construct via illegitimate recombination in the genome of the 
plant cells resulted in kanamycin resistant and aux-2 + cells. Some of these cells were able to grow on NAM-medium 
because the aux-2 gene was not expressed due to incomplete integration of the T-DNA or inactivation of the gene by 
methylation. In case the NPTII marker was inserted correctly at the target locus via homologous recombination the aux- 

20 2 gene was lost. The resulting cells were resistant to kanamycin and the absence of the aux-2 gene enabled them to 
grow on auxin-free medium containing a high concentration of NAM. 

From the smaller control experiments it was estimated that 5% of the initial protoplasts survived (equal for NAA- and 
NAM- containing medium) and 1 5% of the regenerated calli were resistant to kanamycin. In the large targeting experiment 
the regeneration of protoplasts on NAM-containing medium gave a 10 to 100 fold enrichment for cells that did not express 

25 the aux-2 gene. The PCR technique was used to identify the calli obtained through the desired recombination event. 
Genomic DNA was extracted from pooled tissue of 10 calli (Lassner etal, 1 989) and tested in a PCR. The primers used 
in the reaction respectively annealed in the NPTII coding region and in the upstream region of the SSU promoter that 
was deleted in T-DNA construct pNTSS512. PCR will only result in amplification of a fragment of expected size if recom- 
bination has occurred between incoming T-DNA and target-locus. In fact, in several calli a recombination event was 

30 detected. Plants were regenerated from these calli as described and Southern analysis was performed on these plants 
and on progeny of these plants. In a few plant lines the NPTII module was found to be inserted correctly via homologous 
recombination at the target locus. It was estimated that the gene targeting frequency ranged from 10" 4 to 10' 5 . 

Deposition 

35 

For the purpose of enablement the following E.coli strains have been deposited at the "Centraal Bureau voor 
Schimmelcultures" CBS at Baarn, The Netherlands: (strain Dh5a; genotype: F", endA1 , hsdR1 7 (rk m + k) supE44, thi- 
1 , lambda', recA1 , gyrA96, re]A1 , A(argF-]aczya)U169, phi80dlaczAM15). 
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july 21 1989 


CBS 348.89 pSDM4 








pSDM10 








PSDM200/201 


45 






pSDM7 




pSDM101 






50 


PSDM102 










PSDM104 




pSDM9 


july 21st 1989 


CBS 








346.89 pSDM21 0/211 




pSDM14 


july 21st 1989 


CBS 347.89 pNTSS11 


55 






pNTSS12 






pNTSS21 








pNTSS22 
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E. coli strain useful for DH5a with plasmid construct 


(Date of Deposition) 


(Deposition Number) 








pNTSS512 


5 


pSICl 


july 21st 1989 


CBS 349.89 pNTSSn 








pNTSS12 








pNTSS21 








pNTSS22 


10 






pNTSS512 



15 



35 



45 



The Agrobacterium strain LBA4404, which is a good acceptor strain for all binary plant transformation vectors, has 
already been deposited earlier (Febr. 24th. 1983) and is available via the "Centraal Bureau voor Schimmelcultures" 
(CBS) at Baam, The Netherlands, under number CBS 191.83. 

Legend 

List of symbols and abbreviations used in the figures. 

A double lined box indicates the coding region of the Km r and the Hm r gene (NPTTII and HPT). The 
non-coding region is indicated by a single lined box. 

Fragment comprising the promotor region of the nopaline synthase gene (5'NOS) and the right T- 
DNA border repeat of the nopaline Ti-plasmid. 

3' region of the nopaline synthase gene containing the signal for termination of transcription (3NOS). 
3' region of the octopine synthase gene containing the transcription terminator (3'OCS). 
Promoter region of the 35S transcript of CaMV. 
210bp Eco RI fragment containing the E. coli supF . 
Right T-DNA border repeat 
Left T-DNA border repeat 

(Gene conferring) hygromycin resistance (to plant cells) 
(Gene conferring) kanamycin resistance (to plant cells) 
Bacterial kanamycin resistance gene derived from Streptococcus faecalis 
Bacterial ampicillin resistance gene 

DNA sequence coding for Neomycin PhosphoTransferase II 
DNA sequence coding for Hygromycin PhosphoTransferase 



■ 

Hm r (gene) : 
40 Km r (gene) : 
Kanlli : 
Amp r : 
NPTII : 
HPT: 
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Claims 

55 

1 . A method for introducing a defined mutation in a selected target locus of a nuclear plant genome by integrating at 
least a part of a recombinant DNA into said genome through homologous recombination at the target locus, which 
recombinant DNA has the following general structure, 
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m m □ 

in which the boxes 1 and 7 and the connecting lines represent DNA sequences that are capable of functioning as 
a T-DNA border in the DNA transfer process, 

in which box 1 or 7 may be absent but not both, in which box 3 comprises a DNA sequence which is sufficiently 
homologous to a DNA sequence inside the target locus and sufficiently long to promote homologous recombination, 
in which box 4 represents a DNA sequence that is not homologous to sequences occurring in the target locus, 
and in which the lines connecting the boxes may represent any number of basepairs, or no basepairs, 
and identifying transformants having the defined mutation in said target locus using known methods. 

A method according to claim 1 , in which box 1 , 3 and 7, and the connecting lines are as defined in claim 1 , 
in which box 4 comprises a DNA sequence that is also sufficiently homologous to promote homologous recombi- 
nation with its corresponding sequence in the target locus, thereby integrating a part of itself containing one ore 
more mutations, 

in which the lines connecting the boxes may represent any number of basepairs, or no basepairs. 
A method according to claim 1 or 2, the recombinant DNA having the following general structure, 

in which box 1 , 3 and 7, and the connecting lines are as defined in the above claims, 

in which box 4 represents a DNA sequence that is not homologous to sequences occurring in the target locus, 
in which box 5 represents an expression cassette which is functional in the host, containing a positive selection 
gene, 

in which box E comprises DNA sequences sufficiently homologous to DNA sequences that are located adjacent 
to, or in the vicinity of the target locus in the plant genome and sufficiently long to promote homologous recombi- 
nation. 

A method according to claim 1 or 2, the recombinant DNA having the following general structure, 
in which box 1 , 3 and 7 are as defined in claim 1 , 

in which the boxes 2 or 6 represent an expression cassette which is functional in the host, containing a negative 
selection gene, 

in which box 2 or 6 may be absent but not both. 

A method according to claim 4, the recombinant DNA having the following general structure, 
in which box 1 -7 and E, and the connecting lines are defined as in the above claims, 

in which box E comprises a DNA sequence which is sufficiently homologous and sufficiently long to promote ho- 
mologous recombination. 

A method according to claim 1 or 2, the recombinant DNA having the following general structure, 
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45 



55 



in which the boxes 1 -7 and E, as well as the connecting lines are defined as in any of the above claims. 

7. A method according to any of the claims 4-6, 

in which box 2 and box 6 represent DNA sequences which are not homologous with the target locus, which serve 
as a stuff er sequence inhibiting the expression of the boxes 3 and 4 after undesired recombination, 
in which either box 2 or 6 may be absent but not both, in which box 5 may be absent. 

8. A recombinant DNA capable of integrating a part of itself into the genome of a plant host via homologous recom- 
bination at the target locus, 



m HTTR □ 

15 in which the boxes 1 and 7 represent DNA sequences that are capable of functioning as a T-DNA border in the 

DNA transfer process, 

in which box 1 or 7 may be absent but not both, 

in which both box 3 and 4 represent DNA sequences which are sufficiently homologous and sufficiently long to 
promote homologous recombination with the target locus, 
20 in which the DNA sequences within the boxes have the same 5' to 3' orientation as in the target locus, but the 

order of the boxes themselves have been changed with respect to the situation in the target locus, resulting in 
insertion of box i in the target locus after homologous recombination, in which the lines connecting the boxes may 
represent any number of basepairs, or no basepairs. 

25 9. A recombinant DNA according to claim 8, having the following general structure, 

rTIRRTR RR 



in which the boxes 1 , 3, 4, 7 and i, and the connecting lines are defined as in claim 7, 

in which the boxes 2 or 6 represent an expression cassette which is functional in the host, containing a negative 
selection gene, 

in which box 2 or 6 may be absent but not both. 

1 0. A recombinant DNA according to claim 9, in which the negative selection gene is chosen from the group consisting 
of the aux-2 gene, the cytochrome p450 gene, the Adh gene, and the TK-gene. 

11. A recombinant DNA according to any of the claims 8-10, in which the positive selection gene is selected from the 
group consisting of the NPTII gene, the HPT gene, and the ALS gene. 

12. A vector suitable for cloning in bacteria, comprising a construct according to any of the claims 8-11 . 

13. A vector according to claim 12, further comprising, 

an origin of replication, allowing the vector to replicate in its hosts, 
a functional marker gene, to allow for selection of transformed hosts, 
a polylinker site, for easy manipulating of the vector. 

so 14. Bacteria containing a vector according to claim 12 or 13. 

15. A plant transformation vector, containing a T-DNA construct according to any of the claims 8-11 . 

16. A plant transformation vector according to claim 15, in which this vector is a binary vector 

17. Agrobacteria, or bacteria related thereto, containing a plant transformation vector according to claim 15 or 16. 

18. A process for the preparation of plant cells containing a defined mutation or mutations at a selected target locus 
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of the genome via homologous recombination, comprising the steps of, 

-a) coincubating plant protoplasts or plant parts under transforming conditions, with Agrobacteria or species 
related thereto, containing a plant transformation vector which comprises a construct according to any of the 
claims 8-11, or 

coincubating plant protoplasts or plant parts under transforming conditions while introducing a defined 
mutation by the method according to a process according to any of claims 1 -7, 

-b) identifying the desired cells by screening transformed cells at the DNA-level for the desired recombination 
event, using known methods. 

19. A process for the preparation of plant cells containing a defined mutation or mutations at a selected target locus 
of the genome via homologous recombination, comprising the steps of, 

-a) coincubating plant protoplasts or plant parts under transforming conditions, with Agrobacteria or species 
15 related thereto, containing a plant transformation vector which comprises a construct according to claim 9, or 

coincubating plant protoplasts or plant parts under transforming conditions while introducing a defined 
mutation by the method according to a process according to any of claims 4, 5 or 6, 

-b) selecting or enriching for cells which do not express any of the negative marker genes, under conditions 
allowing for selection, 

20 -c) identify the desired cells by screening transformed cells at the DNA-level for the desired recombination 

event, using known methods. 

20. A process according to claim 1 9, comprising, 

25 - step a) in which the method is that from claim 6, comprising a recombinant DNA having a positive selection 

gene. 

step b) selecting for or enriching for cells which do not express any of the negative marker genes present in 
the construct, and which do express the positive selection gene, under conditions allowing for selection, 

30 21. A process for the preparation of plant cells containing a defined mutation or mutations at a selected target locus 
of the genome via homologous recombination, comprising the steps of, 

-a) coincubating plant protoplasts or plant parts under transforming conditions, with Agrobacteria or species 
related thereto, containing a plant transformation vector which comprises a construct according to claim 8, or 

35 coincubating plant protoplasts or plant parts under transforming conditions while introducing a defined 

mutation by the method according to a process according to claim 7, in which box 3 and 4 comprise a DNA 
sequence for which can be selected or screened for after proper homologous recombination, 
-b) selecting or enriching for cells that do express box 3 and 4, and which do not express the negative selection 
gene, under conditions that allow for these selections. 

40 -c) identify the desired cells by screening transformed cells at the DNA-level for the desired recombination 

event, using known methods. 

22. A process according to claim 18-21 , comprising instead of step a) the following steps, 

45 - transforming bacteria with a vector that is capable of replicating in said bacteria, said vector harboring a re- 

combinant DNA according to any of the claims 8-11 , or a recombinant DNA as has been used in the methods 
according to claims 1 -7, 

culturing said bacteria in medium that allows for growth of the bacteria and replication of said vector, 
isolating said vector from the bacteria suspension, 
50 - transforming plants with that vector or part of that vector using direct DNA transformation methods, 

then step b). 

23. A process according to claim 1 9, in which said vector after isolating from the bacteria suspension is incubated with 
DNA binding proteins, under binding conditions. 
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Patentanspruche 

1. Verfahren zur Einfuhrung einer definierten Mutation in einen ausgewahlten Ziel-Locus eines nuklearen Pflanzen- 
genoms, bei dem mindestens ein Tail einer rekombinanten DNA durch homologe Rekombination an dem Ziel- 
5 Locus in besagtes Genom integriert wird, wobei die rekombinante DNA die folgende allgemeine Struktur hat, 



15 



20 



in der die Kasten 1 und 7 und die verbindenden Linien DNA-Sequenzen darstellen, die zur Funktion als ein 
T-DNA-Rand in dem DNA-Transferprozess befahigt sind, 

in der der Kasten 1 oder 7, nicht jedoch beide abwesend sein konnen, in der der Kasten 3 eine DNA-Sequenz 
umfasst, die genugend homolog zu einer DNA-Sequenz innerhalb des Ziel-Locus und genugend lang zur 
Bewirkung der homologen Rekombination ist, 

in der der Kasten 4 eine DNA-Sequenz darstellt, die nicht homolog zu den in dem Ziel-Locus vorkommenden 
Sequenzen ist und in der die Linien, welche die Kasten verbinden, eine beliebige Zahl von Basenpaaren Oder 
kein Basenpaar darstellen konnen, 

und bei dem die Transformanten mit der definierten Mutation in dem besagten Ziel-Locus unter Verwendung be- 
kannter Methoden identifiziert werden. 



2. Verfahren nach Anspruch 1 , wobei die Kasten 1 , 3 und 7 sowie die Verbindungslinien wie in Anspruch 1 definiert 
25 sind, 

bei dem der Kasten 4 eine DNA-Sequenz umfasst, die ebenfalls genugend homolog ist, um eine homologe Re- 
kombination mit der entsprechenden Sequenz im Ziel-Locus zu bewirken, wodurch sie ein Teil ihrer selbst mit 
einer oder mehreren Mutationen integriert, 

und bei dem die Linien, welche die Kasten verbinden, eine beliebige Zahl von Basenpaaren oder kein Basenpaar 
30 darstellen konnen. 



3. Verfahren nach Anspruch 1 oder 2, wobei die rekombinante Struktur die folgende allgemeine Struktur hat, 



40 



45 



in der die Kasten 1, 3 und 7 und die Verbindungslinien wie in den obigen Anspruchen definiert sind, 

in der der Kasten 4 eine DNA-Sequenz darstellt, die zu den im Ziel-Locus vorkommenden Sequenzen nicht 

homolog ist, 

in der der Kasten 5 eine Expressionskassette darstellt, die im Wirt f unktional ist und ein positives Selektionsgen 
enthalt, 

in der der Kasten E DNA-Sequenzen umfasst, die zu den neben dem Ziel-Locus oder nahe dem Ziel-Locus 
im Pflanzen-genom liegenden Sequenzen genugend homolog und zur Bewirkung der homologen Rekombi- 
nation genugend lang sind. 



4. Verfahren nach Anspruch 1 oder 2, wobei die rekombinante DNA die folgende allgemeine Struktur hat, 



50 



m m r 



55 



in der die Kasten 1 , 3 und 7 wie in Anspruch 1 definiert sind, 

in der die Kasten 2 Oder 6 eine Expressionskassette darstellen, die im Wirt funktional ist und ein negatives 
Selektionsgen enthalt, 

in der der Kasten 2 oder 6, nicht jedoch beide abwesend sein konnen. 
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5. Verfahren nach Anspruch 4, wobei die rekombinante DNA die folgende allgemeine Struktur hat, 



in der die Kasten 1 bis 7 und E sowie die Verbindungsiinien wie in den obigen Anspruchen definiert sind, 
in der der Kasten E eine DNA-Sequenz umfasst, die genugend homolog und genugend lang zur Bewirkung 
einer homologen Rekombination ist. 

6. Verfahren nach Anspruch 1 oder 2, wobei die rekombinante DNA die folgende allgemeine Struktur hat, 



15 



ZL 



in der die Kasten 1 bis 7 und E sowie die Verbindungsiinien wie in einem der obigen Anspruche definiert sind. 

20 7. Verfahren nach einem der Anspruche 4 bis 6, 

bei dem Kasten 2 und Kasten 6 DNA-Sequenzen darstellen, die zum Ziel-Locus nicht homolog sind und als 
Fullsequenz dienen, welche die Expression der Kasten 3 und 4 nach unerwunschter Rekombination unter- 
druckt, 

25 - bei dem entweder Kasten 2 oder 6, nicht aber beide abwesend sein konnen, und bei dem Kasten 5 f ehlen kann. 

8. Rekombinante DNA, die zur Integration eines Teils ihrer selbst in das Pflanzen-Genom mittels homologer Rekom- 
bination beim Ziel-Locus befahigt ist, 



30 
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35 



in der die Kasten 1 und 7 DNA-Sequenzen darstellen, die zur Funktion als ein T-DNA-Rand in dem DNA-Trans- 
ferprozess befahigt sind, 

in der der Kasten 1 oder 7, nicht jedoch beide abwesend sein konnen, 

in der die beiden Kasten 3 und 4 DNA-Sequenzen darstellen, die genugend homolog und genugend lang zur 
Bewirkung der homologen Rekombination mit dem Ziel-Locus sind, in der die DNA-Sequenzen innerhalb der Ka- 
sten dieselbe 5' zu 3'-Orientierung wie beim Ziel-Locus haben, die Reihenfolge der Kasten selber jedoch gegenuber 
der Situation im Ziel-Locus geandert ist, so dass sich die Insertion des Kastens i in den Ziel-Locus nach der ho- 
mologen Rekombination ergibt, und in der die Linien, welche die Kasten verbinden, eine beliebige Anzahl Basen- 
paare oder kein Basenpaar darstellen konnen. 

9. Rekombinante DNA nach Anspruch 8, mit der folgenden aligemeinen Struktur, 



45 



nu 



so jn der die Kasten 1 , 3, 4, 7 und i, sowie die Verbindungsiinien wie in Anspruch 7 definiert sind, 

in der die Kasten 2 oder 6 eine Express ions kassette darstellen, die im Wirt f unktional ist und ein negatives Selek- 
tionsgen enthalt, 

in der der Kasten 2 oder 6, nicht aber beide abwesend sein konnen. 

55 10. Rekombinante DNA nach Anspruch 9, in der das negative Selektionsgen aus der Gruppe bestehend aus dem 
aux-2-Gen, dem Cytochrom p450-Gen, dem Adh-Gen und dem TK-Gen ausgewahlt ist. 



11. Rekombinante DNA gemass einem der Anspruche 8 bis 10, in der das positive Selektionsgen aus der Gruppe 
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bestehend aus dem NPTII-Gen, dem HPT-Gen und dem ALS-Gen ausgewahlt ist. 

12. Zur Klonierung in Bakterien geeigneter Vektor, beinhaltend ein Konstrukt nach einem der Anspruche 8 bis 11. 

5 13. Vektor nach Anspruch 12, beinhaltend des weiteren 

einen Replikationsursprung, der dem Vektor die Replikation in seinen Widen ermoglicht, 
ein funktionales Markergen, urn die Selektion nach transformierten Wirten zu ermoglichen, 
einen Polylinker-Ort zur leichten Handhabung des Vektors. 

10 

14. Bakterien enthaltend einen Vektor nach Anspruch 12 Oder 13. 

15. Pflanzen transformations- Vektor, enthaltend ein T-DNA-Konstrukt nach einem der Anspruche 8 bis 11 . 

is 16. Pflanzen transformations- Vektor nach Anspruch 15, in dem dieser Vektor ein binarer Vektor ist. 

17. Agrobakterien oder damit verwandte Bakterien, enthaltend einen Pflanzentransformations-Vektor nach Anspruch 
15 oder 16. 

20 18. Verfahrenzur Herstel lung von Pflanzenzellen, dieeinedefinierte Mutation oder Mutationen an einem ausgewahlten 
Ziel-Locus des Genoms enthalten, uber homologe Rekombination, beinhaltend die Schritte 

a) Ko-lnkubieren von Pflanzenprotoplasten oder Pflanzenteilen unter transformierenden Bedingungen, mit 
Agrobacteria oder damit verwandten Arten, die einen Pflanzentransformations-Vektor enthalten, der ein Kon- 

25 strukt nach einem der Anspruche 8 bis 1 1 beinhaltet, oder 

Ko-lnkubieren von Pflanzenprotoplasten oder Pflanzenteilen unter transformierenden Bedingungen, wobei 
eine definierte Mutation gemass der Methode des Verfahrens nach einem der Anspruche 1 bis 7 eingefuhrt 
wird, 

b) ldentifizieren der erwunschten Zellen, indem die transformierten Zellen auf DNA-Stufe unter Verwendung 
30 bekannter Methoden auf das erwunschte Rekombinationsereignis Gberpruft werden. 

19. Verfahren zur Herstellung von Pflanzenzellen, die eine definierte Mutation oder Mutationen an einem ausgewahlten 
Ziel-Locus des Genoms enthalten, uber homologe Rekombination, beinhaltend die Schritte 

35 a)Ko-lnkubieren von Pflanzenprotoplasten oder Pflanzenteilen unter transformierenden Bedingungen, mit 

Agrobacteria oder damit verwandten Arten, die einen Pflanzentransformations-Vektor enthalten, der ein Kon- 
strukt nach Anspruch 9 beinhaltet, oder 

Ko-lnkubieren von Pflanzenprotoplasten oder Pflanzenteilen unter transformierenden Bedingungen, wobei 
eine definierte Mutation gemass der Methode des Verfahrens nach einem der Anspruche 4, 5 oder 6 eingefuhrt 
40 wird, 

b) Selektion nach oder Anreicherung von Zellen, die unter Bedingungen, die die Selektion ermoglichen, keines 
der negativen Markergene exprimieren, 

c) ldentifizieren der erwunschten Zellen, indem die transformierten Zellen auf DNA-Stufe unter Verwendung 
bekannter Verfahren auf das erwunschte Rekombinationsereignis Gberpruft werden. 

45 

20. Verfahren nach Anspruch 19, beinhaltend, 

Schritt a), bei dem die Methode diejenige von Anspruch 6 ist und eine rekombinante DNA mit einem positiven 
Selektionsgen verwendet wird, 

so - Schritt b) Selektion nach oder Anreicherung von Zellen, die unter Bedingungen, die die Selektion ermoglichen, 

keines der im Konstrukt vorhandenen negativen Markergene, aber das positive Selektionsgen exprimieren. 

21. Verfahren zur Herstellung von Pflanzenzellen, welche eine definierte Mutation oder Mutationen bei einem ausge- 
wahlten Ziel-Locus des Genoms enthalten, uber homologe Rekombination, beinhaltend die Schritte 

55 

a)Ko-lnkubieren von Pflanzenprotoplasten oder Pflanzenteilen unter transformierenden Bedingungen, mit 
Agrobacteria oder damit verwandten Arten, die einen Pflanzentransformations-Vektor enthalten, der ein Kon- 
strukt nach Anspruch 8 beinhaltet, oder 
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Ko-lnkubieren von Pflanzenprotoplasten Oder Pflanzenteilen unter transformierenden Bedingungen, wobei 
eine definierte Mutation gemass der Methode des Verfahrens nach Anspruch 7 eingefuhrt wird, in dem die 
Kasten 3 und 4 eine DNA-Sequenz beinhalten, nach der, nach ordnungsgemasser homologer Rekombination, 
selektioniert Oder uberpruft werden kann, 
5 b) Selektion nach oder Anreicherung von Zellen, die unter Bedingungen, welche diese Selektionen ermogli- 

chen, die Kasten 3 und 4, nicht aber das negative Selektionsgen exprimieren, 

c)ldentifizieren der erwunschten Zellen, indem die transformierten Zellen auf DNA-Stufe unter Verwendung 
bekannter Verfahren auf das erwunschte Rekombinationsereignis uberpruft werden. 

10 22. Verfahren nach einem der Anspruche 18 bis 21 , beinhaltend anstelle des Schritts a) die folgenden Schritte, 

a)Transformieren von Bakterien mit einem Vektor, der zur Replikation in den besagten Bakterien befahigt ist, 
wobei der besagte Vektor eine rekombinante DNA nach einem der Anspruche 8 bis 11 , oder eine rekombinante 
DNA wie sie in den Verfahren nach einem der Anspruche 1 bis 7 verwendet wurde, einschliesst, 
'5 b)Kultivieren der besagten Bakterien in einem Medium, das das Wachstum der Bakterien und die Replikation 

des besagten Vektors ermoglicht, 

c) lsolierung des besagten Vektors aus der Bakteriensuspension, 

d) Transformieren von Pflanzen mit diesem Vektor oder einem Teil dieses Vektors unter Verwendung von Me- 
thoden der direkten DNA-Transformation. 

20 e) dann Schritt b). 

23. Verfahren nach Anspruch 1 9, in dem der besagte Vektor nach der Isolation von den Bakterien unter Bedingungen, 
die die Bindung ermoglichen, mit DNA-bindenden Proteinen inkubiert wird. 
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Revendications 

1. Proc6d6 d'introduction d'une mutation d6finie dans un locus cible s6lectionn6 d'un g6nome v6g6tal nucl6aire par 
integration d'au moins une partie d'un ADN recombinant dans ledit g6nome a I'aide d'une recombinaison homo- 
logue au niveau du locus cible, lequel ADN recombinant a la structure g6n6rale suivante, 



HL 



35 dans laquelle les boltes 1 et 7 et les lignes de connexion repr6sentent des sequences d'ADN qui sont ca- 

pables de fonctionner comme une bordure d'ADN -T dans le processus de transfert d'ADN, 

dans laquelle la boTte 1 ou 7 peut etre absente mais pas les deux a la fois, dans laquelle la botte 3 comports 

une sequence d'ADN qui est suffisamment homologue a une sequence d'ADN situ6e a l'inte>ieur du locus cible 

et qui est suffisamment longue pour faciliter une recombinaison homologue, 
40 dans laquelle la boTte 4 represente une sequence d'ADN qui n'est pas homologue aux sequences survenant 

dans le locus cible, et dans laquelle les lignes de connexion des boTtes peuvent repr6senter n'importe quel nombre 

de paires de bases, ou aucune paire de bases, 

et par identification des transformants ayant la mutation definie dans ledit locus cible en utilisant des proced6s 

connus. 

45 

2. Proc6d6 selon la revendication 1 , dans lequel les boTtes 1 , 3 et 7, et les lignes de connexion sont comme d^finies 
dans la revendication 1 , 

dans lequel la boTte 4 comporte une sequence d'ADN qui est 6galement suffisamment homologue afin de 
faciliter une recombinaison homologue avec sa sequence correspondante situ6e dans le locus cible, integrant de 
50 ce fait une partie d'elle-meme contenant une ou plusieurs mutations, 

dans lequel les lignes de connexion des boTtes peuvent reprdsenter n'importe quel nombre de paires de 
bases, ou aucune paire de bases. 



3. ProcedS selon la revendication 1 ou 2, I'ADN recombinant ayant la structure g6n6rale suivante, 

m m~ii m m r 
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dans laquelle les boTtes 1, 3 et 7, et les lignes de connexion sont comme d^finies dans les revendications 
ci-dessus, 

dans laquelle la boite 4 represente une sequence d'ADN qui n'est pas homologue a des sequences sun/enant 
dans le locus cible, 

dans laquelle la boTte 5 represente une cassette d'expression qui est fonctionnelle chez Phote, contenant un 
gene de selection positive, 

dans laquelle la boTte E comporte des sequences d'ADN suffisamment homologues a des sequences d'ADN 
qui sont localisees de maniere adjacente a, ou au voisinage du locus cible dans le genome de la plante et suffi- 
samment longues afin de faciliter une recombinaison homologue. 

ProcedS selon la revendication 1 ou 2, I'ADN recombinant ayant la structure gen6rale suivante, 

JL 



dans laquelle les boites 1 . 3 et 7 sont comme d6finies dans la revendication 1 , 

dans laquelle les boTtes 2 ou 6 represented une cassette d'expression qui est fonctionnelle chez I'hdte, 
contenant un gene de selection negative, 

dans laquelle la boTte 2 ou 6 peut etre absente mais pas les deux a la fois. 

Proc6d§ selon la revendication 4, I'ADN recombinant ayant la structure g6n6rale suivante, 



JZL 



FFl E R Q L 



dans laquelle les boites 1 a 7 et E, et les lignes de connexion sont definies comme dans les revendications 
ci-dessus, 

dans laquelle la boite E comporte une sequence d'ADN qui est suffisamment homologue et suffisamment 
longue pour faciliter une recombinaison homologue. 

Procede selon la revendication 1 ou 2, I'ADN recombinant ayant la structure gendrale suivante, 



3 4 



dans laquelle les boites 1 a 7 et E, ainsi que les lignes de connexion sont decides selon Tune quelconque 
des revendications precSdentes. 

Proc6d6 selon Tune quelconque des revendications 4 a 6, 

dans lequel la boite 2 et la boite 6 represented des sequences d'ADN qui ne sont pas homologues au locus 
cible, qui servent de sequences de remplissage inhibant I'expression des boites 3 et 4 apres une recombinaison 
non-voulue, 

dans lequel la boite 2 ou la boite 6 peut etre absente mais pas les deux a la fois, dans lequel la boite 5 peut 
etre absente. 

ADN recombinant capable d'integrer une partie de lui-meme dans le g£nome d'un note vegetal via une recombi- 
naison homologue au niveau du locus cible, 



HI 



3 | x | 4 



HI 



dans lequel les boites 1 et 7 represented des sequences d'ADN qui sont capables de fonctionner comme une 
bordure d'ADN-T dans le processus de transfert d'ADN, 

dans lequel la boite 1 ou 7 peut etre absente mais pas les deux a la fois, 

dans lequel a la fois la boite 3 et la boite 4 represented des sequences d'ADN qui sont suffisamment ho- 
mologues et suffisamment longues afin de faciliter une recombinaison homologue avec le locus cible, 
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dans lequei les sequences d'ADN situ6es au sein des boites ont la mdme orientation 5' vers 3' que dans le 
locus cible, mais I'ordre des boites elles-m§mes a 6X6 change par rapport a la situation dans le locus cible, ayant 
pour resultat une insertion d'une boite i dans le locus cible apres une recombinaison homologue, dans lequei les 
lignes de connexion des boites peuvent repr6senter n'importe quel nombre de paires de bases, ou aucune paire 
5 de bases. 

9. ADN recombinant selon la revendication 8, ayant la structure g6ne>ale suivante, 



ph r 



dans laquelle les boites 1 , 3, 4, 7 et i, et les lignes de connexion sont definies comme dans la revendication 7, 
dans laquelle les boites 2 ou 6 represented une cassette d'expression qui est fonctionnelle chez I'hote, 
contenant un gene de selection negative, 

dans laquelle la boTte 2 ou 6 peut §tre absente mais pas les deux a la fois. 



10. ADN recombinant selon la revendication g, dans lequei le gene de selection negative est choisi parmi le groupe 
constitue du gene aux-2, du gene du cytochrome p450, du gene d'Adh, et du gene de TK. 

11. ADN recombinant selon Tune quelconque des revendications 8 a 10, dans lequei le gene de selection positive est 
selectionne parmi le groupe constitue du gene de NPTII, du gene de HPT, et du gene d'ALS. 

12. Vecteur approprie au clonage dans des bacteries, comportant une construction selon Tune quelconque des re- 
vendications 8 a 1 1 . 



13. Vecteur selon la revendication 12, comportant de plus, 



une origine de replication, lui permettant de se repliquer dans son note, 
30 - un gene marqueur fonctionnel, afin de permettre une selection des notes transformed, 

un site d'agent de liaisons multiples, pour une manipulation aisee du vecteur. 



14. BactGrie contenant un vecteur selon la revendication 12 ou 13. 



35 15. Vecteur de transformation veg6tale, contenant une construction d'ADN-T selon Tune quelconque des revendica- 
tions 8 a 1 1 . 



16. Vecteur de transformation veg6tale selon la revendication 15, dans laquelle ce vecteur est un vecteur binaire. 



40 17. Agrobacterie, ou bacterie apparentee a celle-ci, contenant un vecteur de transformation vegetale selon la reven 
dication 15 ou 16. 



18. Processus de preparation de cellules v6g6tales contenant une mutation ou des mutations definies au niveau d'un 
locus cible selectionne du g6nome via une recombinaison homologue, comportant les dtapes consistant a, 

45 

a) mettre a co-incuber des protoplastes de plantes ou des parties de plantes dans des conditions transfor- 
mantes, avec Agrobacteria ou avec des especes apparentees a celle-ci, contenant un vecteur de transforma- 
tion vegetale qui comporte une construction selon Tune quelconque des revendications 8 a 11, ou 

mettre a co-incuber des protoplastes de plantes ou des parties de plantes dans des conditions transfor- 
so mantes, tout en introduisant une mutation dSfinie par le proc6de selon un processus selon I'une quelconque 

des revendications 1 a 7, 

b) identifier les cellules voulues en criblant les cellules transformers au niveau de I'ADN en ce qui concerne 
I'6v6nement de recombinaison voulue, en utilisant des proceeds connus. 



19. Processus de preparation de cellules vegStales contenant une mutation ou des mutations ddfinies au niveau d'un 
locus cible s§lectionn6 du genome via une recombinaison homologue, comportant les 6tapes consistant a, 

a) mettre a co-incuber des protoplastes de plantes ou des parties de plantes dans des conditions transfor- 
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mantes, avec Agrobacteria ou avec des especes apparentSes a celle-ci, contenant un vecteur de transforma- 
tion vdgetale qui comporte une construction selon la revendication 9, ou 

mettre a co-incuber des protoplastes de plantes ou des parties de plantes dans des conditions transfor- 
mantes, tout en introduisant une mutation definie par le precede selon un processus selon Tune quelconque 
5 des revendications 4, 5 ou 6, 

b) selectionner les cellules ou enrichir en cellules qui n'expriment aucun des genes marqueurs negatifs, dans 
des conditions permettant la selection, 

c) identifier les cellules voulues en criblant les cellules transformees au niveau de I'ADN en ce qui concerne 
I'ev6nement de recombinaison voulue, en utilisant des precedes connus. 

10 

20. Processus selon la revendication 1 9, comportant, 

I'etape a) dans laquelle le procede est celui de la revendication 6, comportant un ADN recombinant ayant un 
gene de selection positive, 

*5 - I'etape b) de selection des cellules ou d'enrichissement en cellules qui n'expriment aucun des genes marqueurs 

negatifs presents dans la construction, et qui expriment le gene de selection positive, dans des conditions 
permettant la selection. 

21 . Processus de preparation de cellules veg6tales contenant une mutation ou des mutations definies au niveau d'un 
20 locus cible du genome via une recombinaison homologue, comportant les etapes consistant a, 

a) mettre a co-incuber des protoplastes de plantes ou des parties de plantes dans des conditions transfor- 
mantes, avec Agrobacteria ou avec des especes apparentees a celle-ci, contenant un vecteur de transforma- 
tion vegetale qui comporte une construction selon la revendication 8, ou 

25 mettre a co-incuber des protoplastes de plantes ou des parties de plantes dans des conditions transfor- 

mantes, tout en introduisant une mutation definie par le procede selon un processus selon la revendication 
7, dans lequel les boites 3 et 4 component une sequence d'ADN qui peut etre selectionnee ou criblee apres 
une recombinaison homologue correcte, 

b) selectionner les cellules ou enrichir en cellules qui expriment les bottes 3 et 4, et qui n'expriment pas le 
30 gene de selection negative, dans des conditions qui permettent ces selections, 

c) identifier les cellules voulues en criblant les cellules transformees au niveau de I'ADN en ce qui concerne 
Pevenement de recombinaison voulue, en utilisant des precedes connus. 

22. Processus selon les revendications 18 a 21 , comportant au lieu de I'etape a) les etapes suivantes consistant a, 

35 

transformer des bacteries avec un vecteu r qui est capable de se repliquer dans lesdites bacteries, ledit vecteur 
abritant un ADN recombinant selon Tune quelconque des revendications 8 a 11, ou un ADN recombinant tel 
que celui utilise dans les precedes selon les revendications 1 a 7, 

cultiver lesdites bacteries dans un milieu qui permet la croissance des bacteries et la replication dudit vecteur, 
40 - isoler ledit vecteur de la suspension bacterienne, 

transformer les plantes avec ce vecteur ou avec une partie de ce vecteur en utilisant des precedes de trans- 
formation d'ADN directe, 
puis etape b). 

45 23. Processus selon la revendication 19, dans lequel ledit vecteur apres isolement a partir de la suspension bacte- 
rienne est mis a incuber avec des proteines de liaison de I'ADN, dans des conditions de liaison. 
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Fig. 1 
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